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models.   However,  prior  knowledge  of  forecast  errors  from  a 
quarterly  autoregressive  model  provides  better  predictions  of 
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This  result  is  inconsistent  with  previous  research,  and  is 
anomalous  given  analysts'  greater  accuracy. 
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1 .   Introduction 


Analysts'  forecasts  of  earnings  are  increasingly  used  in 
accounting  and  finance  research  as  expectations  data,  to  proxy 
for  the  unobservable  "market"  expectation  of  a  future  earnings 
realization.   Since  a  diverse  set  of  forecasts  is  available  at 
any  time  for  a  given  firm's  earnings,  composites  are  used  to 
distill  the  information  from  the  diverse  set  into  a  single 
expectation.   This  paper  considers  the  relative  merits  of  several 
composite  forecasts  as  expectations  data.   The  primary  result  is 
that  the  most  current  forecast  available  is  more  accurate  than 
either  the  mean  or  the  median  forecast.   This  result  is 
consistent  with  the  hypothesis  that  individual  forecasters 
incorporate  information  from  others'  previous  predictions  into 
their  own.   Further,  it  suggests  that  the  forecast  date,  which 
previous  research  has  largely  ignored,  dominates  i d i osyncrac i es 
of  individual  forecasters  as  a  characteristic  relevant  for 
distinguishing  among  forecasts. 

The  second  contribution  of  this  paper  is  a  comparison  of 
analysts  with  time  series  models,  which  are  a  competing  source  of 
earnings  expectations.   Consistent  with  previous  research,  I  find 
that  analysts  generally  are  more  accurate  than  time-series 
models.   However,  I  find  that  prior  knowledge  of  the  forecast 
errors  from  a  simple  au toregress i ve  model  on  the  univariate 
series  of  quarterly  earnings  provides  better  predictions  of 


excess  stock  returns  than  prior  knowledge  of  analysts'  forecast 
errors.   This  result  is  inconsistent  with  prior  research,  and 
somewhat  anomalous  given  analysts'  greater  accuracy. 

A  third  contribution  of  this  paper  is  a  methodological 
refinement  of  the  techniques  used  to  evaluate  forecasts..   I 
demonstrate  the  existence  of  significant  t i me -per i od -s pec i f i c 
effects  in  forecast  errors.   If  time-series  and  cross -sec t i on 
data  are  pooled  without  taking  these  effects  into  account,  the 
statistical  results  may  be  overstated,  and  the  results  are 
subject  to  an  aggregation  bias.' 

In  section  2,  I  describe  proxies  for  consensus  in  analysts' 
forecasts,  as  well  as  two  quarterly  time-series  models  used  in 
the  empirical  tests.   In  section  3  I  describe  the  data.   The 
statistical  tests  and  results  are  discussed  in  section  4,  and 
section  5  is  a  summary  with  some  concluding  remarks. 


2j___Prox_ies_.for_Exp.ect.ed_  Earnings 


2.1   Defining  Consensus  for  Analysts'  Forecasts 


The  motivations  for  seeking  a  consensus  expectation  when 
many  forecasts  are  available  are  primarily  practical.   First,  if 
individual  forecasts  contain  idiosyncratic  error  which  can  be 
diminished  or,  ideally,  completely  diversified  away,  then  more 
accurate  forecasts  can  be  obtained  by  combining  forecasts  from 


different  sources  into  a  single  consensus  number.1   Second,  in 
many  contexts  the  earnings  expectation  is  not  the  central  issue. 
but  is  a  necessary  piece  of  data.   For  example,  a  measure  of 
unanticipated  earnings  is  necessary  to  adjust  for  effects  of 
simultaneous  earnings  releases  in  measuring  non-earnings  events. 
Reducing  the  measurement  error  in  anticipated  earnings,  and 
therefore  in  unanticipated  earnings,  increases  the  power  of  tests 
of  the  non-earnings  event. 

Academic  researchers  have  used  a  variety  of  methods  to 
aggregate  analysts'  forecasts  into  a  single  expectation. 
Barefield  and  Comiskey  (1975),  and  Fried  and  Givoly  (1982)  use 
the  mean  of  a  set  of  forecasts.   Brown  and  Rozeff  (1978)  use  a 
single  forecast  from  ValueLine.   Givoly  and  Lakonishok  (1979) 
select  the  "most  active"  forecaster  for  each  firm  from  those 
available  in  Standard  and  Poors'  Earnings  Forecaster.   Elton, 
Gruber,  and  Gultekin  (1981)  and  Brown,  Foster  and  Noreen  (1984) 
consider  both  means  and  medians  published  in  the  I/B/E/S  Summary 
database.   Brown,  Griffin,  Hagerman  and  Zmijewski  (1985)  combine 
a  ValueLine  forecast  with  several  time-series  forecasts.   I 
compare  three  composites  from  a  set  of  available  forecasts:   the 
mean,  the  median,  and  the  most  current  forecast. 

An  implicit  assumption  behind  the  use  of  either  the  mean  or 
the  median  forecast  to  represent  the  consensus  is  that  all 
forecasts  are  current,  so  that  cross-sectional  differences  in 
forecasts  are  attributable  to  differential  use  of  the  same  global 
set  of  information.   Gains  from  combining  forecasts  arise  either 


from  the  employment  of  more  information  in  the  aggregate  than  is 
used  by  any  individual,  or  from  diversification  across 
individuals'  idiosyncratic  errors. 

In  fact,  however,  the  analysts'  forecasts  available  at  any 
time  have  been  produced  at  varying  dates.   If  analysts 
incorporate  information  from  others'  forecasts  in  producing  their 
own,  then  more  current  forecasts  may  be  more  accurate,  and 
possibly  more  representative  of  a  "market"  expectation.   By 
examining  the  most  current  forecast  as  a  definition  of  consensus, 
I  apply  the  extreme  form  of  this  argument,  providing  a  comparison 
of  the  relative  importance  of  forecast  age  and  individual 
forecast  error.   If  diversifying  across  individual  i d i osyncrac i es 
is  more  important  than  eliminating  out-dated  forecasts,  then  the 
single  most  recent  forecast  will  not  necessarily  be  more  accurate 
than  aggregations  of  many  forecasts. 

Using  the  most  current  forecast  as  a  consensus  expectation 
addresses  the  capital  markets  context  in  which  forecasts  are  most 
likely  used.   The  implicit  assumption  is  that,  since  information 
is  disseminated  relatively  rapidly  in  these  markets,  recent 
releases  have  more  value  than  previous  ones.   Thus,  the  most 
current  forecast  has  an  observable  characteristic  which  is 
presumed  to  have  value.   Its  value,  however,  depends  upon  how  the 
forecast  is  used  by  investors  and  others  in  decisions.   The 
process  by  which  earnings  information  is  translated  into  prices 
or  other  outcomes  is  far  from  we  1 1 -under s tood .   I  do  not 
undertake  the  task  of  describing  this  process,  but  assume  that 


one  exists,  and  that  it  results  in  more  current  information 
having  higher  value. 

2.2   Quarterly  Time-series  Models  of  Earnings 


I  use  quarterly  time-series  models  of  earnings  as 
benchmarks,  against  which  analysts'  forecasts  are  compared. 
Time-series  models  have  been  used  frequently  in  previous  research 
to  provide  earnings  expectations.  [Analysts ,  however,  have  the 
advantage  of  a  much  broader  information  set  than  is  employed  in  a 
univariate  model,  including  industry  and  firm  sales  and 
production  figures,  general  mac roeconom i c  information,  and  other 
analysts'  forecasts,  in  addition  to  the  historical  series  of 
earnings.   Analysts'  forecasts,  therefore,  seem  likely  to  be  more 
accurate  than  forecasts  from  univariate  models.  J 

Several  studies  (Brown  and  Rozeff  (1978),  Collins  and 
Hopwood  (1980),  Fried  and  Givoly  (1982))  demonstrate  that 
analysts  are  more  accurate  than  univariate  models,  presumably 
because  of  the  broader  information  set  they  can  i ncorpora te . S 
Fried  and  Givoly  (1982)  also  find  that  analysts'  forecast  errors 
are  more  closely  associated  with  excess  stock  returns  than  are 
those  of  univariate  models.   Nonetheless,  univariate  models 
remain  a  common  means  of  generating  earnings  expectations. 

An  advantage  of  univariate  time-series  models  is  the 
relative  ease  with  which  earnings  data  can  be  obtained  for 
moderate  samples  of  firms.   This  advantage  is  tempered  by  the 
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caveat  that  the  data  requirements  of  the  models  impart  a 
"survivorship"  bias  to  samples.   Another  advantage  to  time-series 
models  is  the  relative  simplicity  of  the  models  used  to  generate 
expectations.   Parsimonious  models  with  a  single,  simple  ARIMA 
structure  applied  to  all  firms  have  been  shown  to  predict  at 
least  as  well  as  univariate  models  with  more  complex, 
individually-specified  structures,  when  one -s tep-ahead  forecast 
errors  are  compared. 

I  use  the  following  two  quarterly  time-series  models,  from 
Foster  ( 1977 ) : 


E  [  a  _    =a_     +e.-  +  e.^(a.i   ,,  -  a  . .   ^  ) 
jtqJ      Jtq-4      JO      jlv  Jtq-l      jtq-5 


and  : 


E[a  ..  ]  =  a  ..   .  +  e._ 

jtqJ      Jtq-4      j2 


(1  ) 


2) 


where  aj^g  denotes  quarterly  reported  earnings  for  firm  j  in 
quarter  q  of  year  t,  and  6  j  o  ■  6  j  l  and  9  j  2  are  estimated 
parameters.   The  models  are,  respectively,  a  first  order 
autor egress ive  process  in  the  fourth  differences  with  a  drift, 
and  a  random  walk  in  the  fourth  differences  with  a  drift.   I 
chose  these  particular  models  because  of  their  relative 
simplicity,  and  because  they  have  proven  to  be  at  least  as  good 
as  other  mechanical  quarterly  models.   The  data  used  and  the 
estimation  of  these  models  are  described  in  section  3. 


3 .   Data 


3.1   Sources 


The  forecast  data  are  from  the  Institutional  Brokers 
Estimate  System,  or  I/B/E/S,  a  database  developed  by  the  Lynch. 
Jones  and  Ryan  brokerage  house.   The  database  consists  of 
forecasts  made  between  early  1975  and  mid-1982,  by  analysts  at 
between  50  and  130  brokerage  houses.   Between  1000  and  2500 
firms'  EPS  are  forecast,  depending  on  the  month  and  year  in 
question.   The  analysts  and  brokerage  houses  are  identified  by 
code  numbers.   Each  brokerage  house  employs  many  analysts,  but  at 
most  one  forecast  is  reported  from  each  brokerage  house  at  any 
given  time,  for  any  given  firm  and  year.   The  data  used  in  this 
paper  are  the  individual  analysts'  forecasts,  and  their 
associated  forecast  dates. 

Lynch,  Jones,  and  Ryan  update  the  database  on  or  around  the 
25th  of  each  month.   From  the  updated  list  of  forecasts,  summary 
statistics  such  as  the  mean,  the  median,  the  standard  deviation, 
the  number  increasing,  etc. ,  are  computed.   The  summary 
statistics  are  then  sold  to  clients,  primarily  institutional 
investors . 2 

C0MPUSTAT  is  the  source  of  earnings  data  and  most  earnings 
announcement  dates.   The  remaining  announcement  dates  are  from 
the  Wall  Street  Journal  (WSJ)  and  its  Index.   Stock  return  data 
and  the  trading  day  calendar  are  from  the  CRSP  Daily  Returns 
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file.   Data  on  stock  splits  and  stock  dividends  are  from  the  CRSP 
Monthly  Master  file. 

The  initial  sample  comprises  the  set  of  firms  in  the  I/B/E/S 
database  with  December  yearends,  and  with  forecasts  available  for 
each  year  from  1975  through  1981.   This  set  contains  508  firms, 
and  3556  firm-years.   Firm-years  are  excluded  if  the  annual 
earnings  are  not  available  on  COMPUSTAT,  or  if  all  four  quarterly 
earnings  announcement  dates  are  not  available  from  COMPUSTAT  or 
the  WSJ.   This  criterion  reduces  the  sample  to  457  firms,  with 
3440  firm-years.   The  estimation  requirements  of  the  quarterly 
models,  30  continuous  quarters  of  data  prior  to  1975-IV,  further 
reduce  the  sample  to  190  firms,  with  1284  firm-years.   The  final 
requirement,  listing  on  CRSP,  reduces  the  sample  to  184  firms 
with  1260  firm-years. 

Analysts  occasionally  produce  forecasts  for  fully-diluted, 
rather  than  primary,  EPS.   Primary  EPS  are  earnings  divided  by 
the  average  number  of  outstanding  common  shares,  including  common 
stock  equivalents  such  as  stock  options  and  warrants.   Fully- 
diluted  EPS  are  earnings  divided  by  the  average  number  of 
outstanding  common  shares,  computed  "to  show  the  maximum 
potential  dilution  of  current  EPS"  (APB  Opinion  No.  15),  and 
include  in  the  denomimator  securities  such  as  contingently 
issuable  common  stock.   I  convert  forecasts  of  fully-diluted  EPS 
to  primary  EPS,  using  the  ratio  of  fully-diluted  to  primary  EPS 
for  that  firm  and  year  from  COMPUSTAT.   I  also  adjust  for  stock 
splits  and  stock  dividends  that  were  announced  between  the 
forecast  date  and  the  annual  earnings  announcement  date. 


3.2   Measuring  Forecast  Errors 

Forecast  errors  are  the  elementary  data  I  use  to  evaluate 
forecasts.   The  forecast  error  e^jtT  is  defined  as  the  difference 
between  Ajt,  actual  earnings  per  share  (EPS)3  of  firm  j  in  year 
t,  and  fjjtT'  t'ie  forecast  of  EPS  from  source  i,  at  a  horizon  t 
prior  to  the  realization: 


e      =  A    -  f 
ijtx     jt     ijtT 


(3) 


The  source  of  the  forecast,  denoted  by  i,  is  one  of  the 
following:  the  mean,  the  median,  or  the  most  current  of  a  set  of 
analysts'  forecasts:  or  one  of  the  two  benchmark  quarterly  models 
described  in  section  2.2.  The  composite  analyst  forecasts  are 
constructed  from  a  set  of  forecasts  which  includes  the  most 
recent  forecast  by  each  individual  in  the  database,  prior  to  a 
fixed  horizon  date.4 

Forecasts  for  each  firm  and  year  are  selected  at  five  fixed 
horizons  of  less  than  one  year  in  duration.   They  are:   240,  180, 
120,  60  and  5  trading  days  prior  to  the  announcement  of  annual 
earnings.   The  first  four  horizons  correspond  roughly  to  dates 
following  each  of  the  year's  quarterly  earnings  announcements; 
the  fifth  is  immediately  prior  to  the  annual  earnings 
announcement.   For  example,  a  horizon  of  240  trading  days  will 
usually  correspond  to  a  date  after  the  previous  year's  annual 
announcement,  but  before  the  current  year's  first  quarter 
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announcement.   A  horizon  of  180  trading  days  will  typically 
correspond  to  a  date  between  the  first  quarter  announcement  and 
the  second  quarter  announcement;  and  so  on. 

The  motivation  for  selecting  horizons  corresponding  to  dates 
in  different  fiscal  quarters  is,  first,  to  observe  whether  there 
is  an  evolution  of  expectations  over  the  course  of  the  year,  and 
second,  to  ensure  that  the  horizons  differ  in  a  well-defined, 
observable,  and  potentially  relevant  respect:   the  amount  of 
quarterly  earnings  data  available.5 

Given  a  horizon  T,  firm  j,  and  year  t,  the  selected 
forecasts  are  the  most  recent  available  from  each  brokerage  house 
issuing  a  forecast  for  firm  j  in  year  t.   I  use  the  dates 
assigned  to  forecasts  by  analysts,  not  the  dates  of  the  I/B/E/S 
file  in  which  they  first  appear.   The  lag  between  the  analyst's 
date  for  a  forecast  and  the  date  of  its  first  appearance  on 
I/B/E/S  averages  34  trading  days,  and  has  a  standard  deviation  of 
44.5  trading  days.   Thus,  some  recently-updated  forecasts  are 
omitted  from  each  monthly  listing  by  I/B/E/S.   Since  published 
summary  statistics  are  computed  from  these  monthly  lists,  the 
summary  statistics  also  fail  to  reflect  some  recent  updates. 

Using  analysts'  dates  instead  of  I/B/E/S  "publication"  dates 
results  in  a  realignment  of  the  data,  with  fewer  omissions  of 
recently  updated  forecasts.   Previous  studies  of  analysts' 
forecasts  have  used  the  publication  date,  not  the  analyst's 
forecast  date,  to  select  forecasts.   For  example.  Fried  and 
Givoly  (1982)  and  Givoly  and  Lakonishok  (1982)  select  their 
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samples  based  on  the  publication  date  of  the  Standard  &  Poors' 
Earnings  Forecaster,  their  source  of  forecast  data.   Fried  & 
Givoly  go  on  to  use  analysts'  dates  within  that  sample  to 
distinguish  new  and  old  forecasts.   Brown  and  Rozeff  (1978)  and 
Brown,  Foster  and  Noreen  (1984)  use  datasets  for  which  individual 
analysts'  forecast  dates  are  not  available.   Using  publication 
dates  instead  of  forecast  dates  probably  biases  results  against 
analysts,  by  failing  to  include  some  recent  updates  of  forecasts. 

From  the  set  of  available  forecasts  at  horizon  T  for  each 
firm  j  and  year  t,  I  compute  the  mean  and  median,  and  find  the 
most  current.   I  use  the  mean,  median  and  most  current  as  proxies 
for  the  analysts'  consensus. 

The  fundamental  difference  between  the  most  current  analyst 
forecast  as  a  consensus  definition  and  either  the  mean  or  the 
median,  is  that  the  former  is  constructed  using  the  forecast 
date,  while  the  latter  two  are  not.   In  Table  1,  I  compare  the 
distributions  of  forecast  ages  for  the  mean,  median,  and  most 
current  forecasts.   The  age  of  a  forecast  is  defined  to  be  the 
difference,  measured  in  trading  days,  between  the  forecast  date 
and  the  horizon  date  chosen  for  this  study.   More  generally,  it 
can  be  considered  the  lag  between  the  forecast  date  and  an  event 
date  of  interest  to  the  researcher.   For  Table  1,  I  define  the 
ages  of  the  mean  and  median  forecasts  as,  respectively,  the  mean 
and  the  median  of  the  ages  of  the  forecasts  in  the  set  of 
available  forecasts  for  each  firm,  year  and  horizon.   The 
distribution  described  in  Table  1  is  over  all  firms  and  years, 
for  each  horizon. 
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As  expected  from  its  definition,  the  most  current  forecast 
has  a  distribution  of  ages  much  closer  to  zero  than  either  the 
mean  or  median.   For  the  four  longer  horizons  (240,  180,  120,  and 
60  trading  days),  over  fifty  percent  of  the  most  current 
forecasts  are  less  than  five  trading  days  old.   By  contrast,  over 
fifty  percent  of  the  mean  or  median  ages  at  all  horizons  are  more 
than  forty  trading  days  old.   While  some  of  these  older  forecasts 
may  represent  circumstances  where  little  new  information  has 
arrived,  so  there  was  no  need  to  update,  the  accuracy  results 
which  follow  suggest  that  this  is  not  always  the  case. 

The  quarterly   models  (1)  and  (2)  are  estimated  for  each 
firm,  for  each  quarter  from  1975-1  through  1981-IV.   Parameter 
estimates  are  updated  each  quarter,  using  the  previous  thirty 
quarters'  observations.   Observations  are  adjusted  for  changes  in 
the  number  of  outstanding  shares.   Annual  forecasts  are 
constructed  from  quarterly  forecasts  by  summing  the  appropriate 
realizations  and  forecasts.   For  example,  in  quarter  3,  there 
have  been  two  realizations  of  quarterly  earnings  for  the  year, 
and  two  quarters  remain  to  be  forecast.   The  annual  forecast  from 
a  quarterly  model  during  the  third  fiscal  quarter  of  the  year  is 
the  sum  of  actual  earnings  for  quarters  1  and  2,  and  forecasts 
for  quarters  3  and  4. 

Because  of  a  small  number  of  untraceable  influential 
observations  which  altered  the  regression  results,  I  imposed  an 
arbitrary  censoring  rule  on  the  data  as  well:   all  forecast 
errors  larger  than  $10.00  per  share  in  absolute  value  were 
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deleted  from  the  sample.   Since  typical  values  for  EPS  numbers 
are  in  the  range  of  $1.00  to  $5.00  per  share,  errors  of 
sufficient  magnitude  to  be  deleted  are  rare,  and  suggest  a  data 
entry  or  transcription  error. 

3.3   Stock  Returns 


I  measure  the  new  information  impounded  in  stock  returns  by 
the  cumulated  prediction  errors  from  a  market  model  in 
logarithmic  form: 


E[ln( 1  +  R  .  ) ]  =  a  .    +  B  .  1 n (  1  +  R„  ) 
js       j     j         Ms 


(4) 


where  R,s  is  the  return  to  security  j  on  day  s,  R  m  s  is  the  return 
on  the  CRSP  equally-weighted  market  portfolio  of  securities  on 
day  s,  and  In  denotes  the  natural  logarithm  transformation.^ 

The  parameters  of  (4)  are  estimated  for  each  firm  in  the 
study  using  200  trading  days  of  data  at  a  time,  beginning  in  July 
1974.   Estimated  parameters  are  used  to  predict  ahead  100  trading 
days,  and  excess  returns  are  the  difference  between  the 
realization  ln(l  +  Rjs)  and  the  prediction  based  on  (4).   After 
each  iteration  of  estimation  and  prediction,  the  estimation 
period  is  rolled  forward  by  100  trading  days,  and  new  parameters 
are  est  imated . 

The  estimation  procedure  produces  a  stream  of  predicted 
daily  excess  returns,  £  j  s  .   The  e  j  s  are  cumulated  over  each 
forecast  horizon,  from  the  horizon  date  through  the  announcement 
date  of  annual  EPS,  to  form  UjtT,  the  measure  of  new  information 
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arriving  over  horizon  T  in  year  t  for  firm  j. 

Excess  returns,  not  raw  stock  returns,  are  used  since  excess 
returns  represent  unanticipated  returns.   Inf orma t i ona  1 1 y 
efficient  forecasts  will  have  forecast  errors  comprising  only 
unanticipated  information.   Forecasts  which  are  not 
i n forma t i ona 1 1 y  efficient  will  have  forecast  errors  consisting  of 
both  unanticipated  information  and  information  which  was 
available  at  the  forecast  date  but  was  not  incorporated  into  the 
forecast.   Forecast  errors  from  inefficient  forecasts  may 
therefore  be  correlated  with  the  anticipated  component  of  stock 
returns.   This  correlation  could  cause  a  stronger  association 
with  raw  returns  to  be  observed  for  inefficient  forecasts  than 
for  efficient  forecasts.   This  argues  against  using  the 
association  between  raw  returns  and  forecast  errors  as  a 
criterion  for  evaluating  forecasts,  and  in  favor  of  using  only 
the  unanticipated  portion  of  returns.   This  discussion  is 
continued  in  more  detail  below,  where  test  results  are  reported. 


Results 


4.1   Aggregating  Forecast  Errors 


If  a  forecast  incorporates  all  the  information  available  on 
the  forecast  date  in  an  unbiased  manner,  then  it  can  be  described 
as  an  expectation  in  the  usual  statistical  sense  of  the  word. 
Let  f   denote  such  a  forecast: 
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f  .  .     =  E[A  ...  I  <(>  ^   ] 


(5) 


where  «J>tT  represents  the  information  available  at  a  horizon  t 
prior  to  the  realization,  and  E[  |  ]  is  the  conditional 
expectation  operator. 

In  the  time  between  the  forecast  date  and  the  realization 
date,  new  information  may  arrive.   Even  a  forecast  like  (5), 
which  may  be  ideal  in  the  sense  of  employing  all  information 
available  on  the  forecast  date,  will  omit  unanticipated 
information  which  arrives  later.   Forecast  errors  consist,  in 
part  or  entirely,  of  new  information  revealed  over  the  forecast 
horizon,  i.e.  between  forecast  and  realization. 

Two  closely-related  implications  of  unanticipated 
information  reflected  in  forecast  errors  are  important  for  the 
specification  of  statistical  tests.   First,  contemporaneous 
forecast  errors  will  not,  in  general,  be  cross -sec t i ona 1 1 y 
independent  observations.   Second,  forecast  errors  within  a  year 
which  are  aggregated  cross-sectionally  may  appear  to  be  "biased" 
because  of  the  common  new  information  reflected  in  them. 

An  example  of  information  which  may  be  reflected  in  forecast 
errors  is  an  unanticipated  mac roeconom i c  shock,  which  may  affect 
many  firms  in  a  similar  manner.   This  can  induce  correlation  in 
forecast  errors  across  firms,  for  a  given  year  and  horizon,  and 
across  horizons  within  a  year.   Moreover,  if  the  effect  of  the 
mac roeconomi c  shock  on  firms  has  a  non-zero  mean,  then  a  cross- 
sectional  aggregation  of  forecast  errors,  even  from  unbiased 
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forecasts,  will  also  have  a  non-zero  mean.   This  non-zero  mean  is 
not  bias,  but  rather  is  time-period-specific  new  information. 

If  forecast  errors  are  positively  correlated  across  firms 
within  years,  then  statistical  comparisons  based  on  from  pooled 
time-series  and  cross -sec t i on  data  which  assume  that  observations 
are  cross-sectionally  independent  will  overstate  the  statistical 
validity  of  the  results.  Several  studies  (Brown  and  Rozeff 
(1978),  Elton,  Gruber  and  Gultekin  (1981),  Malkiel  and  Cragg 
(1980),  for  example)  have  compared  forecasts  using  criteria  such 
as  the  number  or  proportion  of  times  that  one  forecasting  method 
outpredicts  another.   This  criterion,  or  any  other  that  assumes 
independent  observations  and  is  applied  to  a  c r os s -s ec t i on ,  could 
obtain  the  appearance  of  statistically  significant  superiority  in 
forecasting  ability  from  an  anecdotal  difference. 

The  tests  developed  in  this  paper  adjust  for  time-period- 
specific  shocks  using  a  simple  fixed  effects  model.   This  model, 
and  the  importance  of  the  adjustment  for  the  results,  are 
descr i  bed  below . 

4.2   Evaluating  Forecasts  -  Bias 

A  simple  model  of  time-period  effects  in  forecast  errors 
is:? 


e  .    =  u    +  n  . 
jtr    Mtx     jtx 


(6) 


where  Utx  Is  the  average  forecast  error  for  year  t  and  horizon  x 
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and  ^jtt  i  s  a  random  error  term,  representing  the  deviation  of 
firm  j's  forecast  error  from  the  common  annual  mean. 

There  may  also  be  firm-specific  information  effects  which 
persist  through  time,  but  the  unanticipated  information  argument 
does  not  apply. 8   If  a  forecast  fully  impounds  information  from 
previous  mistakes  in  an  unbiased  manner,  then  systematically 
recurring  events  will  not  remain  unanticipated  year  after  year. 
Thus  recurrent  firm-specific  forecast  errors  are  not  expected  to 
arise  on  the  basis  of  information  that  was  unavailable  at  the 
time  the  forecast  was  made.9 

I  estimate  (6)  using  least  squares  with  a  dummy  variable  for 
each  year.   The  test  for  bias  is  based  on  the  grand  mean  of  the 
estimated  annual  averages: 


T   IP,, 


(7) 


where  T  is  the  number  of  years  in  the  sample,  and  the  u^x  are  tne 
year-specific  average  forecast  errors,  estimated  separately  for 
each  horizon  x.   The  average  uT  defined  in  (7)  is  a  linear 
combination  of  least-squares  coefficients,  with  estimated 
standard  error: 


n 


- 1    1/2 
[l ' (X'X)   \]  ' 


(8) 


where  \  i s  a  vector  of  ones  of  length  T,  X  is  the  matrix  of  dummy 
variables,  and  sn  is  the  regression  residual  standard  error.  The 
standard  error  (8)  is  a  weighted  version  of  the  residual  standard 
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error  s^,  constructed  from  the  ^jtx-   Tne  residuals  Hj^t  are 
deviations  from  the  annual  averages,  and  so  are  purged  of  time- 
period-specific  information  which  induces  cross-sectional 
correlation  in  the  e  j  t  T  . 

In  Table  2,  the  bias  results  are  presented.   The  reported 
numbers  are  the  forecast  bias,  computed  as  in  equation  (7).    The 
ratio  of  (7)  to  (8)  is  evaluated  as  a  t-statistic,  against  a  null 
hypothesis  of  no  bias,  for  each  analyst  composite,  mean,  median 
and  most  current,  and  for  the  quarterly  models. 

Generally,  forecast  errors  exhibit  statistically  significant 
negative  bias.   Of  the  three  analyst  consensus  measures,  the 
median  uniformly  exhibits  the  smallest  bias,  usually 
indistinguishable  from  zero. 

Negative  bias  corresponds  to  overestimates  of  EPS.   Negative 
bias  in  analysts'  forecasts  is  consistent  with  some  conventional 
wisdom,  which  says  that  analysts  prefer  to  make  optimistic 
predictions  and  "buy"  recommendations,  to  maintain  good  relations 
with  management . 1 °   The  evidence  supporting  this  story  is  weak, 
however,  in  two  respects.   First,  the  median  analyst  forecast 
appears  to  be  unbiased.   Second,  and  more  importantly,  when  the 
analyst  estimates  are  significantly  negative,  they  are 
statistically  indistinguishable  from  those  of  mechanical  time- 
series  models.   The  motive  of  maintaining  good  relations  with 
management  cannot  be  ascribed  to  these  models.   Thus,  support  for 
the  contention  that  analysts  preferentially  issue  optimistic 
forecasts  is  at  best  weak. 
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An  alternative  explanation  which  is  also  consistent  with 
these  results  is  that  analysts  issue  unbiased  forecasts,  but  this 
seven-year  period,  1975  through  1981,  is  one  with  primarily 
negative  unanticipated  EPS.   Unfortunately,  the  most  obvious  way 
to  distinguish  between  the  hypothesis  of  deliberate  optimistic 
bias  and  this  alternative  is  to  collect  data  for  a  longer  span  of 
years.   This  is  not  possible  using  the  I/B/E/S  dataset. 

4.3   Evaluating  Forecasts  -  Accuracy 

I  use  an  approach  similar  to  the  bias  evaluation  described 
in  the  previous  section  for  evaluating  relative  forecast 
accuracy.   Accuracy  is  defined  as  absolute  or  squared  forecast 
error.   For  the  absolute  error  case,  the  model  is: 


JtT 


6     +  6     +  t 
1JT      2tT      jtT 


(9) 


For  the  squared  error  case,  a  similar  model  is  estimated,  with 
(ejtT)2  as  the  left-hand-side  variable.   In  equation  (9)  the 
*ljT  measure  average  accuracy  for  each  firm  j,  and  the  62tT 
measure  average  accuracy  for  each  year  t.   The  Ejtx  are 
deviations  from  the  average  accuracy  in  this  sample  for  firm  j 
and  for  year  t.   Differences  in  accuracy  across  firms  could 
arise,  for  example,  if  there  are  persistent  differences  in  the 
amount  of  information  available  for  different  firms.   Differences 
in  accuracy  across  years  could  arise  if  there  are  more,  or 
larger,  unanticipated  events  in  some  years  than  in  others. 
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Equation  (9)  is  estimated  using  least  squares  on  a  set  of 
dummy  variables  for  firms  and  years.   Average  accuracy  is 
computed  as  a  linear  combination  of  the  estimated  effects:1! 


7   £ 


1JT 


t  =  l 


2JT 


(10) 


Equation  (10)  defines  the  average  absolute  error  accuracy. 
Average  squared  error  accuracy  is  defined  similarly  using  the 
coefficients  from  the  squared  error  version  of  (9).   The 
estimated  standard  error  of  the  average  accuracy  in  equation  (10) 


l  s  : 


S6 


- 1   1/2 
[w* (Z'Z)   y]  ' 


(11) 


In  (11),  Z  represents  the  matrix  of  dummy  variables  used  to 
estimate  equation  (9)  or  its  squared  error  analogue,  u>  is  the 
vector  of  weights  that  transform  the  estimated  parameters  into 
the  average  defined  in  (10),  and  s^  is  the  residual  standard 
error  from  the  regression  equation  (9). 

The  estimates  in  equations  (9)  through  (11)  are  computed  for 
the  mean,  median  and  most  current  analyst  forecasts,  and  for  the 
forecasts  from  the  quarterly  models.   Pairwise  differences  in 
accuracy  are  compared  across  forecast  sources  using  a  t-statistic 
constructed  from  the  average  accuracies  from  (10)  or  its  squared 
error  analogue,  and  the  standard  error  from  (11). 

Tables  3  through  5  summarize  the  results  on  forecast 
accuracy.   The  average  absolute  errors  and  average  squared 
errors,  computed  as  described  in  equation  (10)  for  each  forecast 
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source,  appear  in  Table  3.   Table  4  contains  t-statistics  testing 
pairwise  differences  in  accuracy  among  analysts.   Table  5 
contains  t-statistics  testing  pairwise  differences  in  accuracy 
between  analysts  and  the  quarterly  models. 

Table  3  displays  a  pattern  of  increasing  forecast  accuracy 
as  the  earnings  announcement  date  approaches,  for  all  forecast 
sources.   Both  average  absolute  error  and  average  squared  error 
decline  uniformly  as  the  year  progresses,  for  analysts  and  for 
quarterly  models.   For  example,  the  average  absolute  error  of  the 
most  current  forecaster  declines  from  $0,742  per  share  at  a 
horizon  of  240  days  (almost  a  full  year  prior  to  the 
announcement)  to  $0,292  per  share  at  a  horizon  on  5  days.   This 
pattern  of  convergence  toward  the  announced  EPS  number  is 
consistent  with  forecasts  incorporating  some  new  information 
relevant  to  the  prediction  of  EPS  over  the  course  of  the  year. 

From  Table  3  it  appears  that  the  most  current  analyst  is  no 
worse  than  the  other  sources,  and  that  analysts  dominate 
quarterly  models  in  the  longer  horizons.   The  reader  of  Table  3 
should  avoid  using  the  heuristic  of  counting  the  number  of  times 
that  the  most  current  forecaster  is  most  accurate  to  assess  these 
differences.   The  relative  performance  results  are  highly 
correlated,  both  across  horizons  and  across  definitions  of 
accuracy.   The  results  of  a  statistical  test  for  the  differences 
in  accuracy  which  are  suggested  by  a  perusal  of  Table  3  appear  in 
Tables  4  and  5. 

In  Tables  4  and  5,  a  positive  difference  means  that  the 
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first  of  the  pair  is  less  accurate.   Table  4  contains  the  results 
of  pairwise  comparisons  among  the  three  analyst  consensus 
definitions.   In  terms  of  absolute  error,  which  is  reported  in 
Panel  A,  when  the  differences  are  significant  they  favor  the  mean 
over  the  median  and  the  most  current  forecaster  over  either  the 
mean  or  the  median.   For  example,  at  the  60-trading-day  horizon. 
the  t-statistic  on  the  difference  between  the  mean  and  median 
forecasts  in  average  absolute  accuracy  is  -2.14,  which  favors  the 
mean,  and  is  significant  at  the  .05  level.   The  t-statistic  on 
the  difference  between  the  median  and  the  most  current  forecasts 
for  the  same  horizon  is  4.97,  which  favors  the  most  current,  and 
is  significant  at  the  .01  level. 

In  contrast,  in  Panel  B,  where  differences  in  squared  error 
accuracy  are  presented,  there  are  no  measurable  differences  in 
accuracy  among  the  three  analyst  consensus  definitions.   The 
signs  of  differences  in  this  panel  are  generally  the  same  as 
those  in  Panel  A,  and  so  the  squared  error  evidence  does  not 
contradict  the  absolute  error  results.   However,  the  very  small 
t-statistics  lend  no  additional  support  to  the  conclusions, 
either. 

The  results  reported  in  Table  5  indicate  that  analyst 
forecasts  generally  dominate  the  time-series  models  at  the  longer 
horizons.   For  the  240,  180  and  1 20- t rad i ng-day  horizons, 
wherever  differences  are  statistically  significant,  the  results 
favor  analysts  over  the  quarterly  models.   This  evidence  is 
consistent  with  the  explanation  that  analysts  use  a  broader 
information  set  than  can  be  exploited  by  a  univariate  model. 


23 


At  the  60-trading-day  horizon,  however,  the  quarterly  time- 
series  models  dominate  the  mean  and  the  median  analyst  forecast 
in  all  comparisons  where  significant  differences  exist.   The  most 
current  forecast  is  never  dominated  to  a  statistically 
significant  extent  by  the  quarterly  models,  but  generally  is 
indistinguishable  from  them. 

In  summary,  the  accuracy  results  reported  in  Tables  3 
through  5  generally  support  the  conjectures  that  analysts,  with 
their  broader  information  set,  are  at  least  as  accurate  as  time- 
series  models,  and  that  the  most  current  analyst  forecast  is  at 
least  as  accurate  as  the  mean  or  the  median.   Although  the 
statistical  significance  of  results  varies  somewhat  across 
forecasting  horizons  and  accuracy  criteria,  wherever  differences 
in  accuracy  between  forecast  sources  are  statistically 
significant,  the  results  conform  with  expectations. 

The  results  reported  here  probably  understate  the  difference 
between  the  most  current  forecast  and  the  mean  and  median 
definitions  which  appear  in  most  other  published  work.   My  sample 
is  selected  to  eliminate  the  "publication  lag"  which  is 
characteristic  of  datasets  which  use  publication  dates,  rather 
than  analysts'  dates,  to  select  forecasts.   Further,  the  results 
of  comparisons  between  analysts  and  quarterly  models  may 
understate  differences  because  of  the  sample  selection  process. 
Since  the  sample  of  firms  is  weighted  toward  stabler  and  longer- 
lived  firms  by  the  data  requirements  of  the  time-series  models, 
the  selection  process  may  exclude  firms  where  analysts' 
information  advantage  is  largest. 
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4.4   Evaluating  Forecasts  -  Market  Association 


The  criteria  developed  in  the  previous  two  sections,  bias 
and  relative  accuracy,  are  common  in  the  literature  on  forecast 
evaluation.   They  do  not,  however,  address  the  context  in  which 
the  forecasts  are  used.   Both  researchers'  and  investors'  use  of 
forecast  data  in  contexts  related  to  securities  markets  suggests 
that  association  with  stock  returns  may  provide  a  relevant 
empirical  comparison. 

If  forecast  errors  reflect  information  relevant  to  the 
firm's  prospects  arriving  after  the  forecast  date,  then,  subject 
to  two  important  qualifications  discussed  below,  this  implies 
that  forecast  errors  will  be  positively  correlated  with  new 
information  impounded  in  stock  returns  over  the  forecast  horizon. 
The  first  qualification  to  this  implied  association  is  that  the 
information  relevant  to  valuing  the  firm's  common  stock  is  not 
precisely  the  same  as  the  information  relevant  to  current-year 
earnings.   There  are  errors  in  both  variables  with  respect  to  the 
measured  association  between  them  which  represents  the  common 
information.   Non-recurring  events,  whether  they  are  treated  as 
extraordinary  items  or  not,  may  affect  earnings  in  a  particular 
year,  but  may  be  inconsequential  to  the  long-term  value  of  the 
firm.   Conversely,  events  that  influence  longer-term  prospects, 
such  as  changes  in  investment  opportunities,  may  affect  the  value 
of  the  firm  without  altering  current  earnings. 
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The  second  qualification  is  that  excess  returns  are 
constructed  to  exclude  one  source  of  unanticipated  information. 
The  excess  return  is  purged  of  its  systematic  relation  with 
market  returns,  which  includes  both  anticipated  and  unanticipated 
market  returns.   Since  informational  ly  inefficient  forecasts  will 
be  correlated  with  anticipated  information,  it  is  desirable  to 
purge  the  stock  returns  of  the  anticipated  component  of  the 
market  return.   Elimination  of  the  unanticipated  market  return, 
however,  may  reduce  the  measurable  association  between  excess 
returns  and  forecast  errors.   On  the  other  hand,  it  is  important 
to  note  that  while  the  power  of  the  tests  for  positive 
association  is  reduced,  the  reduction  in  power  does  not  vary 
across  forecast  sources,  since  they  are  all  evaluated  relative  to 
the  same  excess  returns.   In  other  words,  the  relative  degree  of 
association  across  sources  will  be  unaffected. 

Both  qualifications  noted  above  will  have  the  effect  of 
reducing  the  measurable  association  between  forecast  errors  and 
excess  returns.   Nevertheless,  previous  studies  facing  the  same 
inherent  difficulties  have  found  statistically  significant 
positive  associations  between  unexpected  earnings  and  excess 
stock  returns  (Ball  and  Brown  (1968),  Beaver,  Clarke  and  Wright 
(1979),  and  Fried  and  Givoly  (1982),  for  example). 

The  regression  model  used  to  estimate  the  association 
between  cumulated  excess  returns,  represented  by  UjtT,  and 
forecast  errors,  eijtT*  is: 


e      =ot      +  a  +  B    U     +v.. 

ijtx     lijx     2itx     IT   jtx     ijtx 


(12) 
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In  (12),  Bix^jtx  *  s  the  portion  of  the  forecast  error  from  source 
i  at  horizon  X   which  is  systematically  related  to  excess 
returns.   The  slope  coefficient  8jT  is  the  covariance  between 
excess  returns  and  forecast  errors,  adjusted  for  firm  and  year 
effects,  in  units  of  the  variance  of  excess  returns.   Using 
excess  returns  as  the  independent  variable  and  forecast  errors  as 
dependent  has  the  desirable  feature  that  B^T  and  its  associated 
t-statistic  have  the  same  scale  for  all  sources  i.   If  the  roles 
of  these  two  variables  were  reversed  in  the  regression  equation, 
the  estimated  regression  slope  coefficient  would  depend 
explicitly  on  the  forecast  error  variance  from  source  i. 

The  constants  a  j  ^  j  T  and  ot2itx  measure,  respectively,  firm- 
and  year- spec i f i c  average  forecast  errors,  conditional  on  the 
systematic  relation  with  excess  returns.   The  0t2jtX'  the  year 
effects,  play  an  important  role  in  equation  (12),  since  they 
capture  the  time-period-specific  information  in  forecast  errors 
which  is  not  captured  by  excess  returns.  12   Among  other  things, 
they  include  the  average  effect  of  omitting  the  unanticipated 
component  of  the  market  return.   If  the  a2itx  are  not  included  in 
the  model,  then  they  are  impounded  in  the  regression  residuals  as 
an  omitted  variable.   This  induces  cross-sectional  correlation  in 
the  residuals,  which  if  ignored  leads  to  incorrect  statistical 
inferences,  as  was  discussed  above  for  the  bias  computation. ^ 

Equation  (12)  is  estimated  by  stacking  the  regressions  for 
the  five  forecast  sources,  and  estimating  them  jointly.   The 
forecast  sources  are  the  mean,  median  and  most  current  analyst 
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forecast  and  the  two  quarterly  models.   Estimations  are  performed 
jointly  for  the  five  forecast  sources,  and  separately  for  each 
forecast  horizon. 

Since  the  matrix  of  independent  variables,  which  consists  of 
cumulated  excess  returns  over  the  forecast  horizon  and  dummy 
variables  indicating  firms  and  years,  is  the  same  for  each  of  the 
forecast  sources,  there  is  no  efficiency  gain  over  equation-by- 
equation  least  squares  (see  Zellner  (1962)).   The  advantage  of 
stacking  the  equations  is  for  joint  estimation  of  the  firm  and 
year  effects,  so  that  observations  from  each  of  the  five  forecast 
sources  are  adjusted  for  the  same  firm  and  year  effects. 

Tables  6  through  8  contain  the  regression  results  from 
estimation  of  model  ( 1 2 ) . 1 4   In  Table  6,  I  report  the  estimated 
slope  coefficients  and  their  associated  t -s ta t i s t i cs  ,  testing  the 
statistical  significance  of  the  relation  between  forecast  errors 
and  excess  returns  over  the  forecasting  horizon.   Table  7 
contains  t-statistics  which  test  for  differences  in  the  slope 
coefficients  across  forecast  sources.   Table  8  contains 
regression  summary  statistics,  including  adjusted  R2,  F  - 
statistics,  and  numbers  of  parameters,  and  sample  sizes. 

According  to  results  reported  in  Table  8,  equation  (12) 
explains  between  9%  and  16%  of  the  variation  in  forecast  errors, 
with  slight  variations  across  horizons.   The  largest  adjusted  R2 
appears  at  the  5-trading-day  horizon,  though  differences  in 
explanatory  power  are  not  large.   The  model  has  statistically 
significant  explanatory  power,  according  to  the  regression  F- 
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statistics,  which  reject  the  null  of  no  explanatory  power  at  the 
. 001  level  . 

The  incremental  F-statistics  in  Table  8  confirm  that  the 
year  -  spec i f i c  effects  are  important  in  equation  (12).  The  F- 
statistic  on  the  year-specific  effects  tests  the  null  hypothesis 


H0  :  a2ilT    a2i2x 


a2iTT  =  ° 


That  is,  the  F-statistic  tests  the  null  hypothesis  that 
estimation  of  year-specific  intercepts  adds  no  explanatory  power 
to  the  model.   This  hypothesis  is  rejected  at  the  .05  level  at 
all  horizons,  and  at  the  .001  level  or  better  at  the  horizons 
longer  than  5  days.   The  importance  of  year  effects  in  the  model 
increases  with  the  length  of  the  horizon.   This  is  consistent 
with  the  information-based  explanation  for  their  inclusion  in  the 
model,  namely  that  forecast  errors  impound  t ime-per i od -  spec i f i c 
unanticipated  information.   Over  longer  horizons,  loosely 
speaking,  the  "quantity"  of  unanticipated  information  is  greater. 
The  strength  of  this  result  also  confirms  the  assertions  made 
earlier  that  a  cr os s -sec t ion  of  forecast  errors  for  a  single  time 
period  is  not  a  set  of  independent  observations. 

The  F-statistics  on  firm-specific  effects  reported  in  Table 
8  also  reject  the  null  hypothesis,  which  is: 


0     1 1  IT 


a 


1  i2i 


ot  .  .   =  0 
liJT 


There  are  measurable  firm-specific  differences  in  average 
forecast  error  at  all  horizons,  even  after  the  adjustment  for 
firm-specific  information  impounded  in  excess  returns.   The 
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strength  of  this  result  varies  little  across  forecast  horizons 
and  across  transformations  of  the  dependent  variable. 

The  importance  of  the  slope  coefficients  in  model  (12), 
indicated  by  the  F-statisic  reported  in  Table  8,  varies  across 
forecast  horizons  and  across  transformations  of  the  dependent 
variable.   The  individual  slope  coefficients  reported  in  Table  6, 
however,  are  of  greater  relevance.   Generally,  the  results  in 
Table  6  show  a  pattern  of  positive  association  between  forecast 
errors  and  excess  returns.   A  positive  association  is  expected 
if,  first,  there  is  some  overlap  between  information  relevant  to 
firm  value  and  information  relevant  to  current-year  earnings,  and 
second,  some  of  this  overlapping  information  is  unanticipated 
both  by  investors  and  by  the  predictor  of  EPS. 

The  statistical  significance  of  the  positive  association 
varies  somewhat  across  forecast  horizons,  and  more  importantly 
across  forecast  sources.   The  strongest  results  are  for  the  120- 
day  horizon.   Among  the  analyst  consensus  forecasts,  the 
strongest  results  are  generally  for  the  most  current  forecaster, 
which  is  consistent  with  the  most  current  forecaster  acting  as  a 
reasonable  composite  of  analysts'  information.   The  strongest 
results,  and  the  only  ones  which  are  consistently  positive  and 
statistically  significant,  however,  are  for  the  quarterly 
autoregress i ve  model,  equation  (1).   This  pattern  of  relative 
performance  does  not  vary  substantially  across  horizons  or  across 
transformations  of  the  dependent  variable.   This  result  is 
anomalous,  especially  in  light  of  the  quarterly  model's  relative 
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inaccuracy.   It  indicates  that  prior  knowledge  of  the  forecast 
error  from  a  quarterly  au to r egr es s i ve  model  is  a  better  predictor 
of  excess  returns  than  prior  knowledge  of  the  forecast  error  from 
analysts'  forecasts. 

The  greater  association  of  excess  returns  with  forecast 
errors  from  a  time-series  model  than  with  those  from  analysts  is 
not  consistent  with  the  results  of  Fried  and  Givoly  (1982).   I 
use  quarterly  data  in  the  time  series  models  of  annual  earnings, 
while  Fried  and  Givoly  (1982)  use  annual  data.   Since  models 
using  quarterly  data  produce  more  accurate  forecasts  of  annual 
earnings  than  models  using  only  annual  data  (see  Hopwood,  McKeown 
and  Newbold  (1982)),  presumably  my  tests  are  more  demanding  of 
analysts  than  those  used  by  Fried  and  Givoly.   However,  the 
result  remains  anomalous  since  analysts  are  more  accurate  and  can 
employ  more  information  than  quarterly  time  series  models. 

This  anomaly  is  further  investigated  in  Table  7.  by  testing 
for  the  statistical  significance  of  differences  in  the  slope 
coefficients.   A  further  advantage  of  stacking  equations  across 
forecast  sources  to  estimate  them  is  that  statistical  testing  is 
simplified,  since  a  set  of  linear  constraints  on  the  estimated 
slope  coefficients  BjT  generates  direct  tests  of  differences  in 
slope  across  forecast  sources.   For  example,  if  6T  is  the  vector 
of  five  slope  coefficients,  one  for  each  forecast  source,  and  if 
C12'  is  the  vector  (1,-1,0,0,0),  then  c_i2'£t  estimates  the 
difference  in  slopes  between  the  first  and  second  sources,  with 
estimated  standard  error: 
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c'  6 


-  1    1/2 
[C'  (X'Xlg'c]1^ 


(13) 


I  n  (13),  sv  ^  s  ^  n  e  residual  standard  error  from  the  joint 
estimation  of  equation  (12).    (X'X)-lg  is  tno  lower-right 
submatrix  of  five  rows  and  five  columns  from  the  (X'X)-l  matrix 
of  equation  (12).   This  submatrix  determines  the  variance- 
covariance  relations  among  the  five  slope  coefficients. 

Linear  constraints  of  the  form  of  c  j  2  are  used  to  evaluate 
differences  in  the  slope  coefficients  that  appear  in  Table  6. 
i.e.  differences  across  forecast  sources  in  the  association 
between  excess  returns  and  forecast  errors.   Table  7  contains  the 
results  of  these  statistical  tests.   Results  are  shown  for  tests 
of  pairwise  differences  between  the  quarterly  a u tor egr es s i ve 
model  and  all  other  forecast  sources,  and  for  differences  between 
the  most  current  analyst  and  the  mean  and  median  analyst 
forecasts.   The  statistical  tests  confirm  the  anomalous  result, 
that  errors  from  a  mechanical  quarterly  model  often  are 
significantly  more  closely  related  to  excess  returns  than  errors 
from  analysts.   In  addition,  the  tests  indicate  that  while  the 
most  current  forecast  typically  shows  the  strongest  result  among 
analyst  consensus  forecasts,  the  difference  is  not  statistically 
significant,  in  general. 


5J Summary_and_Conclusions 


Analysts'  predictions  of  EPS  are  a  potential  source  of 
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"market  expectations"  information.   I  have  examined  properties  of 
different  composites,  or  consensus  forecasts.   Results  reported 
here  indicate  that  the  most  current  forecast  from  a  set  of 
analysts'  forecasts  is  a  reasonable  aggregation  of  the 
information  in  the  set. 

Five  alternate  sources  of  forecasts  are  examined:   the  mean, 
median,  and  most  current  from  a  set  of  analysts'  forecasts,  an 
autoregressive  model  in  fourth  differences  of  the  univariate 
series  of  quarterly  EPS,  and  a  f ou r t h -d i f f erenced  random  walk 
using  quarterly  EPS.   The  two  quarterly  time-series  models  are 
included  primarily  as  benchmarks. 

The  most  current  forecast  is  at  least  as  accurate  as  either 
the  mean  or  median  forecast,  and  generally  dominates  them  in 
absolute  error  terms.   This  result  indicates  that  the  date  of  the 
forecast  is  relevant  for  determining  its  accuracy,  and  dominates 
"diversification"  obtained  by  aggregating  forecasts  from 
different  sources.   Since  most  published  aggregations  of 
forecasts  and  most  previous  research  treat  all  forecasts  as  if 
they  are  equally  current,  they  ignore  this  relevant  piece  of 
information.   In  this  sample  the  forecast  error  from  the  most 
current  forecast  is  more  closely  associated  with  excess  returns 
over  the  forecast  horizon  than  the  error  from  the  mean  or  the 
median  forecast,  but  the  difference  in  association  is  not,  in 
general,  statistically  significant. 

Analysts  generally  are  significantly  more  accurate  than 
time-series  models.   Errors  from  the  quarterly  autoregressive 
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model,  however,  appear  to  be  more  closely  related  with  excess 
returns  over  the  forecasting  horizon  than  those  of  analysts, 
however.   Because  of  this  anomalous  result,  it  is  unclear  that 
analysts  provide  a  better  model  of  the  "market  expectation"  than 
mechanical  models. 

It  should  be  noted,  though,  that  the  sample  of  firms  was 
reduced  sharply  by  the  data  requirements  of  the  time-series 
models.   This  sample,  with  its  selection  bias  toward  longer-lived 
firms  with  continuous  data  available,  does  not  clearly  isolate 
cases  where  analysts  might  be  expected  to  have  the  most  advantage 
over  mechanical  models,  and  perhaps  eliminated  many  of  these 
cases.   These  firms,  where  there  is  a  substantial  amount  of  non- 
earnings  information  expected  to  have  an  impact  on  earnings,  may 
be  a  fruitful  area  for  future  investigation. 
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FOOTNOTES 


1 A  n  example  of  the  intuitive  agument  that  diversification  across 
forecast  sources  improves  accuracy  can  be  found  in  Beaver  (1981) 
which  discusses  forecasts  of  sports  scores. 

2Brown ,  Foster  and  Noreen  (1984)  analyze  a  related,  but 
different,  dataset,  taken  from  the  set  of  summary  statistics 
which  I/B/E/S  produces  for  clients. 

3 "Earnings"  and  "earnings  per  share"  (or  "EPS")  are  sometimes 
used  interchangeably  in  this  paper.   The  data  consist  of 
forecasts  of  EPS,  and  I  wish  to  abstract  from  the  effects  of 
changes  in  capitalization  on  the  prediction  of  earnings.   When  a 
stock  split  or  stock  dividend  is  announced  between  a  forecast 
date  and  a  realization,  I  adjust  the  forecast  by  the  actual 
change  in  capitalization. 


4  T  h  e  results  reported  in  this  paper  use  the  raw  (unsealed) 
forecast  errors  as  defined  in  equation  (3).   Other  forecast  error 
metrics,  or  scales,  may  be  appropriate,  for  example  to  control 
for  heteroscedasticity.   The  substantive  conclusions  of  the  paper 
are  not  altered,  however,  when  other  scales  are  used  to  measure 
forecast  errors.   In  Appendix  A,  results  are  reported  for  two 
alternative  scales:   standardized  forecast  errors,  and  percent 
forecast  errors.   Standardized  forecast  errors  are  forecast 
errors  divided  by  the  average,  over  the  previous  five  years,  of 
absolute  changes  in  EPS.   Percent  forecast  errors  are  forecast 
errors,  divided  by  the  absolute  value  of  the  prior  year's  EPS. 
Other  scales  which  I  have  investigated,  and  for  which  the 
qualitative  results  of  this  paper  are  replicated,  are:   a  version 
of  the  standardized  forecast  error  where  the  denominator  is  the 
standard  deviation  of  EPS  changes,  and  versions  of  the  the 
percent  forecast  error  where  the  sample  is  censored  to  exclude 
negative  denominators,  or  denominators  less  than  $0.20. 

^Firms  announce  earnings  with  remarkable  consistency  year  after 
year,  so  choosing  fixed  lengths  of  time  prior  to  the  announcement 
date  is  a  fairly  accurate  means  of  finding  dates  that  differ  by 
one  quarter's  earnings.   Of  the  6218  horizons  included  in  this 
paper,  13  dates  did  not  fall  between  the  quarterly  announcements 
as  intended.   The  results  are  not  affected  by  deletion  of  these 
observa t  ions. 

"Results  did  not  differ  from  those  reported  here  when  the  value- 
weighted  market  portfolio  of  securities  was  used  as  a  proxy  for 
the  market. 

'The  subscript  i.  which  indexes  the  source  of  the  forecast,  is 
suppressed  in  the  following  discussion  for  readability. 

8The  estimation  was  also  done  with  firm-specific  effects  in  the 
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model.   The  bias  results  do  not  differ  qualitatively  from  those 
reported  here. 

9  T  h  i  s  discussion  does  not  argue  against  the  existence  of  firm- 
specific  effects  which  persist  through  time.   It  argues  against 
an  explanation  for  such  effects  based  on  unanticipated 
information . 

10For  example,  see  Dirks  and  Gross,  The  Great  Wall  Street 
Scandal,  especially  pp.  252-257.   Also  see  "Bank  Analysts  Try  to 
Balance  their  Ratings",  WSJ,  May  29,  1984.  p.  33;  and   "Picking  a 
Loser",  WSJ,  September  28,  1983.  p.  1 . 


l^The  estimated  effects  are  not  the  same  as  the  est 
coefficients,  because  model  (9)  has  two  sets  of  eff 
and  years.  A  simple  example  will  illustrate  this, 
two  years  and  three  firms  in  the  sample,  then  model 
estimated  with  no  intercept,  using  two  year  dummy  v 
and  DY2)  and  two  firm  dummy  variables  (DF1  and  DF2) 
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The  "year  1  effect"  is  the  average  |e-jtTi  for  t  =  l. 
is  not  estimated  by  d2lT.   Rather,  d21x  is  the  aver 
year  1  for  the  omitted  (third)  firm.   The  "year  1  e 
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firm  1  effect"  is  estimated  by: 
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The  "firm  3  effect"  is  estimated  by: 


&13T  =  (1/2) 
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Since  any  non-redundant  spanning  set  of  dummy  variables  can  be 
used,  the  particular  linear  combinations  of  coefficients  to 
estimate  the  firm  and  year  effects  depend  on  the  model  used. 

12Dropping  the  <Xijjt,  the  f  i  rm- spec  i  f  i  c  effects,  does  not  alter 
the  estimates  of  the  slope  coefficients  B  j  T  0r  their  statistical 
significance  by  a  substantial  amount.   Omitting  the  ot  2  j  t  x  . 
however,  alters  both  the  estimates  and  their  statistical 
significance. 

1  3  a  n  alternative  way  to  model  this  problem  is  to  include  the  year 
or  firm  effects  as  "random  effects",  contributing  off-diagonal 
elements  to  the  covariance  matrix  of  the  VjjtT    Mundlak  (1978) 
shows  that  the  estimate  of  the  slope  coefficient  8  j  T  obtained 
from  a  model  like  (12),  is  identical  to  the  GLS  estimate  which 
would  be  obtained  if  the  firm-  and  year-specific  effects,  «iijx 

and  a 2 j t T ,  were  modeled  as  random  effects  and  included  in  the 
covariance  matrix. 
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14An  alternate  method  of  measuring  the  association  between 
forecast  errors  and  excess  returns,  similar  to  that  of  Ball  and 
Brown  (1968),  is  to  construct  portfolios  based  on  foreknowledge 
of  the  sign  of  EPS  forecast  error.   That  is,  a  long  position  is 
taken  in  each  of  the  securities  for  which  the  forecast  error  is 
positive,  and  a  short  position  is  taken  in  those  with  negative 
errors.   This  procedure,  applied  to  these  data,  produces  results 
qualitatively  identical  to  those  reported  here. 
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Table  1 


Selected  Characteristics  of  the  Distribution  of  Forecast  Ages 
for  the  Mean.  Median  and  Most  Current  Analyst  Forecasts, 
for  Five  Forecast  Horizons 


Horizon 

Fracti  les 

Sa  mg  1 
mean 

e  Moments 
stdev  skew 

Source 

.  1 

.  2  5 

.  5 

.  75 

.  9 

N 

mean 

240 

34 

46 

60 

74 

89 

61.5 

24  .  7 

0  .  8 

1  198 

180 

33 

43 

58 

73 

91 

59  .  9 

23  .  9 

0  .  8 

1  235 

120 

34 

47 

62 

82 

102 

65.9 

28  .  1 

1  .  0 

12  54 

60 

38 

51 

67 

88 

1  10 

71.7 

31.8 

1  .  6 

1258 

5 

40 

52 

67 

87 

109 

72  .  3 

31  .  2 

2  .  5 

1260 

median 

240 

7 

20 

51 

83 

127 

59  .  4 

47  .  9 

1  .  1 

1  198 

180 

5 

16 

41 

78 

131 

56  .  6 

54  .  5 

1  .  5 

1234 

120 

7 

19 

47 

80 

135 

61.1 

58  .  0 

1  .  8 

1254 

60 

7 

19 

49 

88 

155 

66.6 

67  .  7 

2  .  0 

1  258 

5 

15 

26 

43 

71 

131 

61.7 

60  .  8 

2  .  6 

1260 

current 

240 

1 

2 

4 

14 

43 

14.4 

24  .  7 

3  .  3 

1198 

180 

1 

1 

3 

9 

24 

9  .  0 

16.9 

4  .  8 

1233 

120 

1 

2 

4 

11 

25 

10  .  0 

18.8 

6  .  3 

1254 

60 

1 

1 

4 

10 

26 

10.6 

22  .  5 

6  .  8 

1259 

5 

1 

5 

11 

20 

34 

15.8 

22  .  0 

8  .  9 

12  59 

Notes : 

The  forecast  sources  are: 

mean  -  the  mean  of  available  analysts'  forecasts, 

median  -  the  median  of  available  analysts'  forecasts, 

current  -  the  most  recent  forecast  from  an  analyst. 

The  age  of  a  forecast  is  the  number  of  trading  days  between  the 
analyst's  forecast  date  and  the  horizon  date  chosen  in  this  study. 
Specifically,  for  the  sources  in  this  table,  age  is  defined: 

mean  -  the  mean  of  the  ages  of  available  analyst  forecasts 
median  -  the  median  of  the  ages  of  available  analyst 
forecasts  . 
current  -  the  age  of  the  most  recent  analyst's  forecast. 


The  forecast  horizons  are  measured  in  trading  days  prior  to  the  annual 
earnings  announcement. 


Table  2 
Forecast  Bias,  for  Five  Forecast  Sources,  at  Five  Forecast  Horizons 


Source 


Forecast  Horizon,  in  Trading  Days 


240 

180 

120 

60 

5 

-0.15 

-0 

1  1 

-0.14 

-0  .  08 

( 

-3.82) 

(-3 

15) 

(-5.18) 

(-4.11) 

-0.09 

-0 

05 

-0.11 

-0  .  06 

( 

-2.17) 

(-1 

49  ) 

(-4.24) 

(-3.25) 

-0  .  08 

-0 

12 

-0.12 

-0  .  09 

-0.05 

( 

-2.09) 

(-3 

57) 

(-4.70) 

(-4.96) 

(-2.74) 

-0  .  03 

-0 

05 

-0  .  06 

-0  .  02 

0  .  02 

( 

-0.72) 

(-1 

52) 

(-2.25) 

(-1.19) 

(  1  .00) 

-0  .  08 

-0 

10 

-0  .  09 

-0  .  04 

-0  .  02 

( 

-1.91) 

(-2 

93  ) 

(-3.27) 

(-2.34) 

(-1.42) 

q  .  a  .  r 


q  .  r  .  w 


mean 


median 


current 


Notes 


The  computation  of  forecast  bias  and  its  associated  t-statistic  (in 
parentheses)  are  described  in  equations  (6)  through  (8)  in  the  text. 

The  forecast  sources  are: 

q.a.r.  -  a  quarterly  autoregressive  model  in  fourth  differences; 

equation  (1)  in  the  text, 
q.r.w.  -  a  random  walk  model  with  drift  in  fourth  differences: 
equation  (2)  in  the  text, 
mean  -  the  mean  of  the  available  analysts'  forecasts, 
median  -  the  median  of  the  available  analysts'  forecasts, 
current  -  the  most  recent  forecast  from  an  analyst. 

The  forecast  horizons  are  measured  in  trading  days  prior  to  the  annual 
earnings  announcement. 

The  degrees  of  freedom  for  all  reported  t-statistics  are  over  1,000, 
so  they  are  approximately  normal.   For  a  two-sided  test,  the  .05  and 
.01  critical  points  of  the  N(0,1)  distribution  are  1.96  and  2.58, 
respectively. 


Table  3 

Forecast  Accuracy:   Average  Absolute  or  Squared  Forecast  Error 
for  Five  Forecast  Sources,  and  Five  Forecast  Horizons 


] 

forecast  Hor 

i  z  on  .  in 

Trading 

Days 

140 

L80 

120 

60 

5 

Source : 

q  .  a  .  r  . 

0 

975 

0 

780 

0 

592 

0 

350 

Average 

q  .  r  .  w  . 

0 

.  963 

0 

.  781 

0 

.  620 

0 

.  363 

Absolute 

mean 

0 

747 

0 

645 

0 

516 

0 

395 

0 

291 

Forecast 

median 

0 

.  788 

0 

.  677 

0 

.  546 

0 

4  35 

0 

321 

Error 

current 

0 

742 

0 

610 

0 

468 

0 

342 

0 

292 

q  .  a  .  r  . 

2 

.  771 

1 

.  877 

1 

.  203 

0 

460 

Ave  rage 

q  .  r  .  w  . 

2 

626 

1 

.897 

1 

255 

0 

447 

Squared 

mean 

1 

.  531 

1 

.  239 

0 

.  818 

0 

478 

0 

351 

Forecast 

median 

1 

595 

1 

246 

0 

.  790 

0 

546 

0 

409 

Error 

current 

1 

.  638 

1 

.  178 

0 

.  669 

0 

389 

0 

361 

Notes : 

The  computation  of  average  absolute  error  is  described  in  equations 
(9)  and  (10)  in  the  text.   The  computation  of  average  squared  error  is 
analogous  to  the  computation  of  average  absolute  error. 

The  forecast  sources  are: 

q.a.r.  -  a  quarterly  autoregressive  model  in  fourth  differences: 

equation  (1)  in  the  text, 
q.r.w.  -  a  random  walk  model  with  drift  in  fourth  differences; 
equation  (2)  in  the  text, 
mean  -  the  mean  of  the  available  analysts'  forecasts, 
median  -  the  median  of  the  available  analysts'  forecasts, 
current  -  the  most  recent  forecast  from  an  analyst. 


The  forecast  horizons  are  measured  in  trading  days  prior  to  the  annual 
earnings  announcement. 


Table  4 

Pair wise  Differences  in  Forecast  Accuracy 
Among  the  Mean,  Median  and  Most  Current  Analyst  Forecasts 

for  Five  Forecast  Horizons 


Panel  A:   t-Statistics  on  Differences  in  Average  Absolute  Error 


mean  -  median 

mean  -  current 

median  -  current 


Forecast  Horizon,  in  Trading  Days 

240        180        120        60  5 


1.06 
0.11 
1.18 


-0  .  98 

1  .  07 

2  .  05 


1.11 

1  .  82 

2  .  93 


■2.14 
2  .  83 

4  .  97 


1.62 

0  .  02 

1  .  60 


Panel  B:   t-Statistics  on  Differences  in  Average  Squared  Error 


Forecast 

H 

orizon . 

i  n 

Tradin 

p 

Da  vs 

240 

180 

120 

60 

5 

mean 

-  median 

-0  .  26 

-0  .  04 

0  .  20 

-0  .  83 

-0  .  59 

mean  - 

current 

-0  .  44 

0  .  30 

1  .  06 

1  .  09 

-0.10 

median  - 

current 

-0  .  18 

0  .  34 

0  .  86 

1  .  91 

0  .  50 

Notes : 

The  reported  numbers  are  t-statistics  on  pairwise  differences  in 
average  absolute  or  squared  forecast  error.   See  Table  3  for  the 
average  absolute  and  squared  errors.   The  computations  are  described 
in  equations  (9)  through  (11)  in  the  text. 

The  degrees  of  freedom  for  all  reported  t-statistics  are  over  2.000, 
so  they  are  approximately  normal.   For  a  two-sided  test,  the  .05  and 
.01  critical  points  from  the  N ( 0  ,  1  )  distribution  are  1.96  and  2.5  8, 
respectively . 


The  forecast  sources  are: 

mean  -  the  mean  of  the  available  analysts'  forecasts, 

median  -  the  median  of  the  available  analysts'  forecasts, 

current  -  the  most  recent  forecast  from  an  analyst. 

The  forecast  horizons  are  measured  in  trading  days  prior  to  the  annual 
earnings  announcement. 


Table  5 

Pair wise  Differences  in  Forecast  Accuracy  Between  Analysts  and 
Quarterly  Time-Series  Models,  for  Five  Forecast  Horizons 

Panel  A:   t-Statistics  on  Differences  in  Average  Absolute  Error 


Quarter:     1 
Horizon:   240 


2 
180 


3 

120 


4 
60 


q  .  a  .  r  .  -  mean 

5 

8  5 

4.15 

2  .  85 

-2  .  36 

q.a.r.  -  median 

4 

79 

3.17 

1  .  74 

-4.51 

q.a.r.  -  current 

5 

96 

5  .  22 

4  .  67 

0  .  46 

q.r.w.  -  mean 

5 

56 

4  .  20 

3  .  92 

-1.70 

q . r . w .  -  med  ian 

4 

50 

3.21 

2  .  81 

-3  .  84 

q.r.w.  -  current 

5 

.  67 

5  .  27 

5  .  74 

1.13 

Panel  B:   t-Statistics  on  Differences  in  Average  Squared  Error 


Quarter : 
Horizon  : 


1 
240 


2 
180 


3 
120 


4 
60 


q.a.r.  -  mean 

5 

08 

3 

16 

2 

73 

-0 

22 

q.a.r.  -  median 

4 

81 

3 

13 

2 

93 

-1 

04 

q.a.r.  -  current 

4 

64 

3 

47 

3 

79 

0 

87 

q.r.w.  -  mean 

4 

49 

3 

26 

3 

09 

-0 

38 

q.r.w.  -  median 

4 

22 

3 

23 

3 

29 

-1 

21 

q.r.w.  -  current 

4 

05 

3 

56 

4 

15 

0 

70 

Notes  : 


The  reported  numbers  are  t-statistics  on  pairwise  differences  in 
average  absolute  or  squared  forecast  error.   See  Table  3  for  the 
average  absolute  and  squared  errors.   The  computations  are  described 
in  equations  (9)  through  (11)  in  the  text. 

The  forecast  sources  are: 


a  .  r 


q 
q 


mean 

median 

current 


a  quarterly  au tor egr es s i ve  model  in  fourth  differences; 

equation  (1)  in  the  text. 

a  quarterly  random  walk  model  in  fourth  differences; 

equation  (2)  in  the  text. 

the  mean  of  the  available  analysts'  forecasts. 

the  median  of  the  available  analysts'  forecasts. 

the  most  recent  forecast  from  an  analyst. 


The  forecast  horizons  are  measured  in  trading  days  prior  to  the  annual 
earnings  announcement. 

The  degrees  of  freedom  for  all  reported  t-statistics  are  over  2,000, 
so  they  are  approximately  normal.   For  a  two-sided  test,  the  .05  and 
.01  critical  points  from  the  N(0,1)  distribution  are  1.96  and  2.58, 
respectively. 


Table  6 

Slope  Coefficients  from  the  Regression  of  EPS  Forecast  Error 
on  Excess  Return,  for  Five  Forecast  Horizons: 


e  .  _   =  a  ,  .  . 
1 j tx     1  1  JT 


a 


2i  tt 


6  .   U  .. 

IT    J  tT 


i  j  tT 


(  12 


Source  : 
q  .  a  .  r  . 


q  .  r  .  w  . 


mean 


median 


current 


Forecast  Horizon,  in  Trading  Days 
240        180        120        60         5 


0  .  98 
(5.38) 

0  .  29 
(1.61) 

0  .  14 
(0.78) 

0  .  07 
(0.40) 

0  .  32 
(1.75) 


0  .  96 
(5.72) 

0  .  24 
(1.44) 

0.01 
(0.04) 

0  .  07 
(0.40) 

0.29 
(1.75) 


0  .  99 
(6.46) 

0  .  63 

(4.15) 

0  .  34 
(2.20) 

0.41 
(2.68) 

0  .  53 
(3.49) 


0  .  48 
(3.26) 

0  .  18 

(  1  .25) 

0.14 
(0.96) 

0  .  20 

(  1  .  38) 

0  .  37 
(2.49) 


0  .  66 
(1 .68) 

0  .  77 
(  1  .97) 

0  .  25 
(0.64) 


Notes : 

The  forecast  sources  are: 

q.a.r.  -  a  quarterly  autoregressive  model  in  fourth  differences: 

equation  (1)  in  the  text. 
q.r.w.  -  a  random  walk  model  with  drift  in  fourth  differences: 
equation  (2)  in  the  text, 
mean  -  the  mean  of  the  available  analysts'  forecasts, 
median  -  the  median  of  the  available  analysts'  forecasts, 
current  -  the  most  recent  forecast  from  an  analyst. 

The  forecast  horizons  are  measured  in  trading  days  prior  to  the  annual 
earnings  announcement. 

The  variables  in  regression  equation  (12)  are  the  forecast  error  from 
forecast  source  i  for  firm  j  in  year  t  at  horizon  T,  e  j  j  t  T  ,  and  the 
cumulated  excess  returns  for  firm  j  in  year  t  over  horizon  t,  L'jtT. 
Equation  (12)  is  estimated  jointly  for  the  five  forecast  sources  and 
separately  for  each  of  the  forecast  horizons.   The  parameters  «jjjt 
are  firm-specific  effects,  and  the  parameters  t^jt-r  are  year-specific 
effects . 


A  slope  coefficient  8  j  T  is  estimated  for  each  forecast  source  i  and 
horizon  x.   The  statistical  significance  of  the  correlation  between 
forecast  errors  and  excess  returns  is  measured  by  the  t-statistic  on 
the  slope  coefficient,  which  is  reported  in  parentheses.  These  t- 
statistics  have  degrees  of  freedom  greater  than  1,000.   For  a  one- 
sided test,  the  .05  and  .01  critical  points  from  the  N(0,1) 
distribution  are  1.65  and  2.33,  respectively. 


Table  7 

Pairwise  Differences  in  Slope  Coefficients  from  the  Regression 
of  EPS  Forecast  Error  on  Excess  Return,  for  Five  Forecast  Horizons 


e  .  . .   =<x,..   +  an  .  .       +6.   U  . .   +  v  .  . 

ljtT      llJT      2ltT      IT    jtT      ljtT 


{  12 


Panel  A:   t-Statistics  on  Differences  between  Quarterly  Auto- 
regressive  Model  [eqn.  (1)]  and  Other  Forecast  Sources 


Quarter  : 
Horizon: 


1 

240 


2 
180 


3 
120 


4 
60 


q.a.r.  -  q.r.w. 

2  .  69 

3 

08 

1 

68 

1 

47 

q  .  a  .  r  .  -  mean 

3  .  28 

4 

08 

3 

09 

1 

68 

q.a.r.  -  median 

3  .  55 

3 

82 

2 

74 

1 

.  37 

q.a.r.  -  current 

2  .  59 

2 

85 

2 

16 

0 

56 

Panel  B:   t-Statistics  on  Differences  between  the  Most  Current 
and  the  Mean  or  Median  Analyst  Forecasts 


Horizon  : 

240 

180 

120 

60 

5 

mean 
median 

-  current 

-  current 

-0  .  69 
-0  .  96 

-1.23 
-0  .  97 

-0  .  94 
-0  .  59 

-1.12 
-0  .  82 

0  .  76 
0  .  97 

Notes  : 


The 


forecast 
q.a.r. 

q.r.w. 

mean 

median 

current 


sources  are: 

-  a  quarterly  autoregressive  model  in  fourth 
differences;  equation  (1)  in  the  text. 

-  a  random  walk  model  with  drift  in  fourth 
differences;  equation  (2)  in  the  text. 

-  the  mean  of  the  available  analysts'  forecasts. 

-  the  median  of  the  available  analysts'  forecasts 

-  the  most  recent  forecast  from  an  analyst. 


The  forecast  horizons  are  measured  in  trading  days  prior  to  the  annual 
earnings  announcement. 


The  variables  in  regression  equation  (12)  are  the  forecast  error  from 
forecast  source  i  for  firm  j  in  year  t  at  horizon  T,  e  i  j  t  T  ■  a  n  ^  tne 
cumulated  excess  returns  for  firm  j  in  year  t  over  horizon  x,  u  j  t  t  • 
Equation  (12)  is  estimated  jointly  for  the  five  forecast  sources  and 
separately  for  each  of  the  forecast  horizons.   The  parameters  «iijx 
are  f  i  rm- spec  i  f  i  c  effects,  and  the  parameters  ot2itx  are  year-specific 
effects . 


A  slope  coefficient  B j ^  is  estimated  for  each  forecast  source  i  and 
horizon  t.   The  slope  coefficients  are  reported  in  Table  6.   The  t- 
statistics  reported  in  this  table  test  differences  in  slope  across 
forecast  sources,  against  a  null  hypothesis  of  no  difference.   These 
t-statistics  have  degrees  of  freedom  greater  than  2,000.   For  a  one- 
sided test,  the  .05  and  .01  critical  points  from  the  N(0,1) 
distribution  are  1.65  and  2.33,  respectively. 


Table  8 

Regression  Summary  Statistics  for  the  Regression 
of  EPS  Forecast  Error  on  Excess  Return,  for  Five  Forecast  Horizons 


i  jtx 


«,-•   +a«-^   +6.   U  .    +  v  .  .  ^ 
Iijt     2ltT     n   jtt     ijtx 


(12) 


Forecast  Horizon,  in  Trading  Days 
240        180        120        60  5 


adjusted  R2 

full  model  F(k-1 , N-k) 

year-effect  F(kl  ,N-k ) 

firm-effect  F(k2,N-k) 

excess  returns  F(k3,N-k) 


.116 

.  095 

.  107 

.  114 

.  156 

5  .  02 

4  .  27 

4  .  79 

5  .  09 

4  .  60 

54  .  13 

36  .  79 

25.17 

19  .  25 

2  .  94 

3.34 

3  .  08 

3  .  87 

4  .  82 

4  .  65 

6  .  89 

7  .  33 

14.37 

3  .  57 

2  .  18 

5986 

6171 

6267 

6293 

3779 

199 

199 

199 

199 

195 

7 

7 

7 

7 

7 

184 

184 

184 

184 

184 

5 

5 

5 

5 

3 

d.f.  for  F-s ta t i s t i c s : 

sample  size     (N) 
#  of  parameters     (k) 

#  of  years  (kl+1) 

#  of  firms  (k2+l ) 
#  of  slopes    (k3) 


Notes  : 

The  variables  in  regression  equation  (12)  are  the  forecast  error  from 
forecast  source  i  for  firm  j  in  year  t  at  horizon  x,  ejjtT,  and  the 
cumulated  excess  returns  for  firm  j  in  year  t  over  horizon  x,  U  j  t  T  . 
Equation  (12)  is  estimated  jointly  for  the  five  forecast  sources  and 
separately  for  each  of  the  forecast  horizons.   The  parameters  otjjjT 
are  f i rm- spec i f i c  effects,  and  the  parameters  «2itx  are  year-specific 
effects . 

The  full  model  F-statistic  tests  the  null  hypothesis  that  the 
regression  model  (12)  has  explanatory  power.   The  year,  firm,  and 
excess  return  F-statistics  test  the  incremental  explanatory  power  of 
including  groups  of  parameters  in  the  model.   Selected  critical  points 
for  the  F  distribution  are: 


Numerator  d.f 

3 

5 

6 

120 


a 


05 


a  = 


001 


Denominator 

d  . 

120 

<x> 

2  .  68 

2 

60 

2  .  29 

2 

21 

2  .  18 

2 

10 

1  .  35 

1 

22 

Denominator  d 
120 


5 

78 

5 

42 

4 

42 

4 

10 

4 

04 

3 

.74 

1 

77 

1 

45 

Appendix 

In  this  appendix  I  present  results  for  two  alternate 
forecast  error  scales:  forecast  errors  scaled  by  the  average 
absolute  deviation  of  the  past  five  years'  changes  in  EPS,  which 
I  call  standardized  forecast  errors: 


se  .  .  . 
IJtT 


ijtx 


r-       AA  . 

E   I   js 

s  =  t-5 


(Al) 


and  forecast  errors  as  a  percent  of  the  absolute  value  of  the 
prior  year's  EPS,  or  percent  forecast  errors: 


pe 


ijtx 


eiJtT 
V-l 


(A2) 


There  are  two  reasons  for  considering  several  scales  for 
forecast  errors  in  this  study.   The  first  is  comparability  with 
previous  work.   Variants  of  (A2)  have  been  used  frequently  in 
previous  research  to  create  a  unitless  measure  (e.g.,  Cragg  and 
Malkiel  (1968),  Brown  and  Rozeff  (1978),  Beaver,  Clarke  and 
Wright  (1979),  Malkiel  and  Cragg  (1981),  and  Elton,  Gruber  and 
Gultekin  (1981)).   Equation  (A2)  is  appropriate  if  forecasts  of 
EPS  growth,  not  EPS,  are  relevant  for  decisions. 

The  second  reason  for  considering  alternate  scales  is  that 
EPS  forecast  errors  may  be  het er oscedas t i c .   That  is,  there  may 
be  differences  in  predictability  of  EPS  across  firms,  perhaps 
related  to  firm  size,  earnings  variability,  or  available 


information.   Equation  (Al)  scales  forecast  errors  by  the  average 
variation  in  previous  years'  EPS,  as  a  benchmark  for 
predictability.   The  average  variation  is  measured  here  by  the 
average  absolute  value  of  past  changes.   This  measure  is  less 
sensitive  to  outliers  than  the  standard  deviation,  and  describes 
the  variablility  of  a  random  walk  with  trend  when  loss  is 
proportional  to  absolute  error.   Alternatively,  if  predictability 
is  proportional  to  the  level  of  EPS,  then  equation  (A2)  controls 
for  cross-sectional  variation  in  predictability. 


Table  A2 
Forecast  Bias,  for  Five  Forecast  Sources,  at  Five  Forecast  Hor 

(  t  -  s t a t i s t i c s  are  in  parentheses) 


1  zon  s 


Pane  1  A : 

Standardized 
forecast  error 
[ eqn  .  A  1 ] 


Panel  B : 
Percent 
forecast  error 

[eqn  .  A2 ] 


Horizon 

240 

180 

120 

60 

5 

Source : 

q  .  a  .  r  . 

-0 

.  24 

-0 

,  12 

-0  .  20 

-0.11 

(-3 

.  69  ) 

(-2 

.41) 

(-4.70) 

(-3.59) 

q  .  r  .  w  . 

-0 

.  14 

-0 

,  03 

-0.15 

-0  .  08 

(-2 

.24) 

(-0 

.  66  ) 

(-3.57) 

(-2.56) 

mean 

-0  , 

17 

-0 

.  20 

-0.18 

-0.14 

-0 

.  07 

(-2 

.  58) 

(-4 

.  05  ) 

(-4.20) 

(-4  .35) 

(-2 

.  08  ) 

median 

-0  . 

09 

-0 

,  09 

-0  .  08 

-0  .  03 

0 

.  0  3 

(-1 

.  38  ) 

(-1 

.  89  ) 

(-1  .83) 

(-0.94) 

(1 

.  00  ) 

current 

-0  , 

18 

-0 

.  17 

-0.13 

-0  .  08 

-0 

04 

(-2 

.  84  ) 

(-3 

.  48  ) 

(-3.01) 

(-2.60) 

(-1 

.  36  ) 

q  .  a  .  r  . 

0 

0  5 

0 

.  05 

.  00 

-0  .  01 

(1 

.  90  ) 

(2 

■  14) 

(-0.31 ) 

(-0.69) 

q  .  r  .  w  . 

0 

06 

0 

.  07 

0  .  03 

0.01 

(2 

.  24  ) 

(3 

.  25  ) 

(1.78) 

(  1  .54) 

mean 

-0  . 

01 

-0 

.  02 

-0  .  02 

-0  .  02 

-0 

02 

(-0 

.52) 

(-o 

.  80  ) 

(-1.61) 

(-2.72) 

(-2 

.  55  ) 

median 

00 

.  00 

-0  .  01 

-0  .  01 

00 

(0 

.  07  ) 

(0 

.  05) 

(-0.37) 

(-0.70) 

(0 

.21) 

current 

-0  . 

01 

-0 

,  01 

-0  .  01 

-0.01 

-0 

01 

(-0 

.34) 

(-0 

.  47  ) 

(-0.94) 

(-1.31) 

(  -1 

.  11  ) 

Notes  : 


The  computation  of  forecast  bias  and  its  associated  t-statistic  are 
described  in  equations  (6)  through  (8)  in  the  text. 

The  forecast  sources  are: 

q.a.r.  -  a  quarterly  au toregress i ve  model  in  fourth  differences 

equation  (1)  in  the  text, 
q.r.w.  -  a  random  walk  model  with  drift  in  fourth  differences; 
equation  (2)  in  the  text, 
mean  -  the  mean  of  the  available  analysts'  forecasts, 
median  -  the  median  of  the  available  analysts'  forecasts, 
current  -  the  most  recent  forecast  from  an  analyst. 


The  forecast  horizons  are  measured  in  trading  days  prior  to  the  annual 
earnings  announcement. 


The  degrees  of  freedom  for  all  reported  t-statistics  are  over  1,000, 
so  they  are  approximately  normal.   For  a  two-sided  test,  the  .05  and 
.01  critical  points  of  the  N(0,1)  distribution  are  1.96  and  2.58, 
respectively. 


Table  A3 

Forecast  Accuracy: 

Average  Absolute  or  Squared  Forecast  Error 

for  Five  Forecast  Sources,  and  Five  Forecast  Horizons 


Panel  A:   Average  Absolute  Error 


Standardized 
forecast  error 
[ eqn .  Al ] 


Percent 

forecast  error 
[eqn .  A2 ] 


Source : 

q  .  a  .  r  . 

q  .  r  .  w  . 

mean 

median 

current 

q  .  a  .  r  . 

q  .  r  .  w  . 

mean 

median 

current 


240 

478 
494 
147 
208 
181 


0  .  337 

0  .  335 

0  .  244 

0  .  255 

0  .  242 


Horizon 

180 

120 

60 

5 

1 

.112 

0  .  867 

0  . 

505 

1 

.121 

0.919 

0 

.  523 

0 

.  990 

0  .  777 

0  . 

616 

0  , 

467 

1 

.  035 

0  .  833 

0 

.  677 

0 

.516 

0 

.  958 

0  .  733 

0 

555 

0 

492 

0 

.  273 

0.191 

0 

.115 

0 

.  273 

0.212 

0 

.  124 

0 

.212 

0  .  164 

0 

.121 

0 

.  090 

0 

.219 

0.173 

0 

132 

0 

096 

0 

.  202 

0.154 

0 

.  104 

0 

.  088 

Panel  B :   Avera 

g 
S 

e  Squared 

ource : 

Error 
240 

L80 

Horizon 
120 

60 

5 

q  .  a  .  r  . 

6  .  702 

3 

.  042 

2.719 

0 

788 

Standardized 

q  .  r  .  w  . 

6  .  482 

2 

.  950 

2  .  724 

0 

.  793 

forecast  error 

mean 

3  .  450 

2 

.  810 

2  .  066 

1 

,  458 

1 

.  083 

[ eqn .  Al ] 

median 

3.615 

2 

.  857 

2  .  035 

1 

.  529 

1 

.  146 

current 

4  .  766 

3 

.  245 

1  .  896 

1 

466 

1 

.  257 

q  .  a  .  r  . 

1  .  694 

0 

.914 

0  .  247 

0 

.  103 

Percent 

q  .  r  .  w  . 

1  .  479 

1 

.  107 

0  .  481 

0 

144 

forecast  error 

mean 

0  .  456 

0 

.314 

0.163 

0 

.  066 

0 

.  052 

[eqn .  A2] 

median 

0  .  455 

0 

.  326 

0.146 

0 

073 

0 

052 

current 

0  .  468 

0 

.  316 

0  .  206 

0 

.  045 

0 

.  037 

Notes  : 

The  computation  of  average  absolute  error  is  described  in  equations 
(9)  and  (10)  in  the  text.   The  computation  of  average  squared  error  is 
analogous  to  the  computation  of  average  absolute  error. 


The  forecast  sources  are: 

q.a.r.  -  a  quarterly  au toregress i ve  model  in  fourth  differences; 

equation  (1)  in  the  text, 
q.r.w.  -  a  random  walk  model  with  drift  in  fourth  differences; 
equation  (2)  in  the  text, 
mean  -  the  mean  of  the  available  analysts'  forecasts, 
median  -  the  median  of  the  available  analysts'  forecasts, 
current  -  the  most  recent  forecast  from  an  analyst. 


The  forecast  horizons  are  measured  in  trading  days  prior  to  the  annual 
earnings  announcement. 


Table  A4 

Pair wise  Differences  in  Forecast  Accuracy 
Among  the  Mean,  Median  and  Most  Current  Analyst  Forecasts 

for  Five  Forecast  Horizons 


Panel  A : 

t-Statistics  on 

Differences 

i  n  Avera 

ge  Abso 1 u t 

e  Error 

ependent 

H 

or  i  zon 

ar i  abl e : 

240 

180 

120 

60 

5 

td ' zed 

mean  -  median 

-0  .  83 

-0  .  85 

-1.12 

-1.70 

-1 

.  29 

rr  or 

mean  -  current 

-0  .  47 

0  .  60 

0  .  89 

1  .  68 

-0  . 

6  5 

e  q  n  .  A 1  ] 

median  -  current 

0  .  36 

1  .  44 

2  .  01 

3  .  38 

0 

.  64 

ercen t 

mean  -  median 

-0  .  33 

-0  .  28 

-0  .  50 

-1.12 

-0 

82 

rror 

mean  -  current 

0  .  07 

0  .  35 

0  .  58 

1  .  77 

0 

.  29 

eqn .  A2] 

median  -  current 

0  .  40 

0  .  64 

1  .  08 

2  .  89 

1 

,  12 

Panel  B : 

t-Statistics  on 

Differences 

in  Avera 

ge  Squared 

Error 

ependent 

Horizon 

:      240 

180 

120 

60 

5 

ar  i  ab 1 e : 

td ' zed 

mean  -  median 

-0.15 

-0  .  07 

0  .  05 

-0.15 

-0 

.  12 

rror 

mean  -  current 

-1.22 

-0  .  64 

0  .  26 

-0  .  02 

-0 

.  33 

eqn .  Al  ] 

median  -  current 

-1  .  06 

-0  .  57 

0.21 

0.13 

-0 

.  21 

er cen t 

mean  -  median 

.  00 

-0  .  02 

0  .  09 

-0.16 

0 

.  02 

rror 

mean  -  current 

-0  .  01 

.  00 

-0.25 

0.46 

0 

,  76 

eqn .  A2  ] 

median  -  current 

-0.01 

0  .02 

-0  .  34 

0  .  61 

0 

.  74 

Notes  : 

The  reported  numbers  are  t-statistics  on  pairwise  differences  in 
average  absolute  or  squared  forecast  error.   See  Table  A3  for  the 
average  absolute  and  squared  errors.   The  computations  are  described 
in  equations  (9)  through  (11)  in  the  text. 

The  degrees  of  freedom  for  all  reported  t-statistics  are  over  2,000, 
so  they  are  approximately  normal.   For  a  two-sided  test,  the  .05  and 
.01  critical  points  from  the  N(0,1)  distribution  are  1.96  and  2.5  8, 
respectively. 

The  forecast  sources  are: 

mean  -  the  mean  of  the  available  analysts'  forecasts, 
median  -  the  median  of  the  available  analysts'  forecasts, 
current  -  the  most  recent  forecast  from  an  analyst. 


The  forecast  horizons  are  measured 
earnings  announcement. 


in  trading  days  prior  to  the  annual 


Table  A5 

Pairwise  Differences  in  Forecast  Accuracy  Between  Analysts  and 
Quarterly  Time-Series  Models,  for  Five  Forecast  Horizons 


Panel  A 


t-Statistics  on  Differences  in  Average  Absolute  Error 


Dependent 
Var  i  abl e : 

Std  '  zed 

error 

[ eqn .  Al ] 


Quarter 
Horizon 


q.a.r.  -  mean 

q.a.r.  -  median 

q.a.r.  -  current 

q.r.w.  -  mean 

q.r.w.  -  median 

q.r.w.  -  current 


1 
240 


4  .  54 
3.71 
4  .  07 

4  .  75 

3  .  92 

4  .  29 


2 
180 


2  .  28 

1  .  43 

2  .  88 

2  .  45 
1.61 

3  .  05 


3 
120 


1  .  82 

0  .  70 
2.71 

2  .  86 

1  .  74 

3  .  75 


4 

60 

3 

06 

4 

.  76 

1  , 

.  38 

2 

.  57 

4 

.  27 

0 

.  89 

Percent        q.a.r.  -  mean  2.74  2.34 

error        q.a.r.  -  median  2.41  2.06 

[eqn.  A2]   q.a.r.  -  current  2.81  2.69 

q.r.w.  -  mean  2.68  2.33 

q.r.w.  -  median  2.35  2.04 

q.r.w.  -  current  2.75  2.68 


1  .  55 


0  .  67 


1 

.  05 

2 

13 

2 

.  80 

2  . 

29 

3 

.  37 

1 

.  79 

1  . 

1  1 

0 

.  32 

0. 

80 

2 

.  10 

Panel  B:   t-Statistics  on  Differences  in  Average  Squared  Error 


Dependent 
Variable: 


Quarter  : 
Horizon : 


1 
240 


2 
180 


3 
120 


4 
60 


Std'zed  q.a.r.  -  mean  3.00  0.34 
error  q.a.r.  -  median  2.85  0.27 
[eqn.  Al]   q.a.r.  -  current      1.79     -0.30 


0  .  99 

1  .  04 
1  .  25 


-1.38 
-1  .  53 
-1  .  40 


q.r.w.  -  mean      2.80      0.21      1.00 

q.r.w.  -  median      2.65      0.14       1.05 

q.r.w.  -  current       1.59     -0.43      1.26 


-1  .  37 
-1.52 
-1.39 


Percent  q.a.r.  -  mean  1.30  1.11  0.48  0.81 
error  q.a.r.  -  median  1.30  1.09  0.57  0.65 
[eqn.  A2]   q.a.r.  -  current      1.29      1.10      0.23      1.26 


q.r.w.  -  mean 

q.r.w.  -  median 

q.r.w.  -  current 


1  .  08 
1  .  08 
1  .  06 


1  .  46 
1  .  44 
1  .  46 


1.81 
1  .  90 
1  .  56 


71 
55 

17 


Notes  : 

The  reported  numbers  are  t-statistics  on  pairwise  differences  in 
average  absolute  or  squared  forecast  error.   See  Table  3  for  the 
average  absolute  and  squared  errors.   The  computations  are  described 
in  equations  (9)  through  (11)  in  the  text. 


The  forecast 

q  .  a  .  r  .  - 

q  .  r  .  w  .  - 

mean  - 

med  ian  - 

current  - 


sources  are: 

-  a  quarterly  autoregressive  model  in  fourth  differences: 

equation  (1)  in  the  text. 

a  quarterly  random  walk  model  in  fourth  differences; 

equation  (2)  in  the  text. 

the  mean  of  the  available  analysts'  forecasts. 

the  median  of  the  available  analysts'  forecasts. 

the  most  recent  forecast  from  an  analyst. 


The  forecast  horizons  are  measured  in  trading  days  prior  to  the  annual 
earnings  announcement. 


The  degrees  of  freedom  for  all  reported  t-statistics  are  over  2,000, 
so  they  are  approximately  normal.   For  a  two-sided  test,  the  .05  and 
.01  critical  points  from  the  N(0,1)  distribution  are  1.96  and  2.58, 
respectively. 


Table  A6 


Slope  Coefficients  from  the  Regression  of  EPS  Forecast  Error 
on  Excess  Return,  for  Five  Forecast  Horizons: 


Panel  A 


e  .  _   =  oc.,  .  . 
1 jtT     1  a jt 


a 


U,  .   +  v,  _ 

ax   jtx     1 j tx 


(12 


'2itT 
Standardized  forecast  error  [eqn.  Al]  as  dependent  variable 


Horizon 

140 

180 

120 

60 

5 

Source  : 

q  .  a  .  r  . 

0 

9  5 

1 

15 

1  .  25 

0  .  55 

(3 

.  18) 

(4 

68  ) 

(5.00) 

(2.25) 

q  .  r  .  w  . 

0 

48 

0 

33 

0  .  78 

0.17 

(1 

.61) 

(  1 

33  ) 

(3.11) 

(0.68) 

mean 

0 

55 

0 

07 

0  .  45 

0.10 

0  .  49 

(1 

.  85) 

(0 

29  ) 

(  1  .80) 

(0.40) 

(0.67) 

median 

0 

49 

0 

14 

0  .  53 

0.21 

0  .  88 

( 1 

.  65) 

(0 

55  ) 

(2.14) 

(0.84) 

(1.21) 

current 

0 

73 

0 

18 

0  .  76 

0  .  43 

-0  .  88 

(2 

.  44  ) 

(0 

72  ) 

(3.04) 

( 1 .75) 

(-1.21) 

Panel  B:   Percent  forecast  error  [eqn.  A2]  as  dependent  variable 


Source : 
q  .  a  .  r 


q  .  r  .  w 


mean 


median 


current 


240 

0  .  34 
(2.74) 

0.18 

(1  .47) 

0.12 
(0.95) 

0.11 
( 0 . 88  ) 

0.16 
(1.34) 


180 

0  .  38 
( 3 . 58) 

0.15 
(1.42) 

0  .  05 
(0.49) 

0  .  06 
( 0 . 53) 

0.10 
(0.96) 


Horizon 
120 

0  .  35 
(4  .26) 

0  .  27 
(3.23) 

0.15 
(1  .84) 

0.16 
(1  .92) 

0  .  22 
(2.65) 


60 

0.15 
(2.36) 

-0  .  01 
(-0.15) 

0  .  02 

(0.25) 

0  .  03 
(0.47) 

0.11 
( 1 .83) 


-0  .  00 
(-0.01 ) 

0.10 
(0.72) 

0  .  02 
(0.14) 


Notes: 

The  forecast  sources  are: 

q.a.r.  -  a  quarterly  au tor egress i ve  model  In  fourth  differences; 

equation  (1)  in  the  text, 
q.r.w.  -  a  random  walk  model  with  drift  in  fourth  differences: 
equation  (  2  )  in  the  text, 
mean  -  the  mean  of  the  available  analysts'  forecasts, 
median  -  the  median  of  the  available  analysts'  forecasts, 
current  -  the  most  recent  forecast  from  an  analyst. 

The  forecast  horizons  are  measured  in  trading  days  prior  to  the  annual 
earnings  announcement. 


A  slope  coefficient  & ±j    is  estimated  for  each  forecast  source  i  and 
horizon  T.   The  statistical  significance  of  the  correlation  between 
forecast  errors  and  excess  returns  is  measured  by  the  t-statistic  on 
the  slope  coefficient,  which  is  reported  in  parentheses.  These  t- 
statistics  have  degrees  of  freedom  greater  than  1,000.   For  a  one- 
sided test,  the  .05  and  .01  critical  points  from  the  N(0,1) 
distribution  are  1.65  and  2.33,  respectively. 


Table  A7 


Pairwise  Differences  in  Slope  Coefficients  from  the  Regression 
of  EPS  Forecast  Error  on  Excess  Return,  for  Five  Forecast  Horizons: 


e  .  . . 
1  J  tT 


lij'T 


+  a 


2i  tt 


B  .   U  .. 
IT    jtT 


+  v  •  •  *. 
1  J  tT 


(12) 


Panel  A:   t-Statistics  on  Differences  between  Quarterly  Au tor egress i ve 
Model  [eqn.  (1)]  and  Other  Forecast  Sources 


Dependent 
Var  i  ab 1 e  : 

Std ' zed 
error 
[eqn.  A 1 ] 

Percent 

error 

[ eqn .  A2 ] 


q  .  a  .  r  . 
q  .  a  .  r 
q  .  a  .  r  . 
q  .  a  .  r  .  - 

q  .  a  .  r  . 
q  .  a  .  r 
q  .  a  .  r  . 
q  .  a  .  r  .  - 


Quarter : 

1 

2 

3 

4 

Horizon : 

240 

180 

120 

60 

-  q  .  r  .  w  . 

1.12 

2.41 

1  .  37 

1.15 

-  mean 

0  .  95 

3.15 

2  .  32 

1  .  35 

median 

1  .  09 

2  .  97 

2  .  08 

1  .  03 

current 

0  .  53 

2  .  85 

1  .  43 

0  .  36 

-  q  .  r  .  w  . 

0.91 

1  .  55 

0  .  74 

1  .  84 

-  mean 

1  .  28 

2  .  22 

1  .  75 

1  .  54 

median 

1  .  33 

2.19 

1  .  70 

1  .  38 

current 

1  .  00 

1  .  88 

1.17 

0  .  39 

Panel  B:   t-Statistics  on  Differences  between  the  Most  Current  and 
the  Mean  or  Median  Analyst  Forecasts 


Dependent 
Variable: 


Horizon 


240 


180 


120 


60 


Std ' zed 

mean  - 

-  current 

-0 

42 

-0 

30 

-0 

90 

-0 

99 

1 

37 

error 

median 

-  current 

-0 

56 

-0 

12 

-0 

65 

-0 

67 

1 

77 

Percent 

mean  - 

current 

-0 

28 

-0 

34 

-0 

58 

-1 

15 

-0 

1  1 

error 

med  i  an  ■ 

-  current 

-0 

33 

-0 

31 

-0 

52 

-0 

99 

0 

.42 

Notes  : 

The  forecast  sources  are: 

q.a.r.  -  a  quarterly  au tor egress i ve  model  in  fourth 

differences:  equation  (1)  in  the  text, 
q.r.w.  -  a  random  walk  model  with  drift  in  fourth 
differences;  equation  (2)  in  the  text, 
mean  -  the  mean  of  the  available  analysts'  forecasts, 
median  -  the  median  of  the  available  analysts'  forecasts, 
current  -  the  most  recent  forecast  from  an  analyst. 

The  forecast  horizons  are  measured  in  trading  days  prior  to  the  annual 
earnings  announcement. 


A  slope  coefficient  BjT  is  estimated  for  each  forecast  source  i  and 
horizon  T.   The  slope  coefficients  are  reported  in  Table  6.   The  t - 
statistics  reported  in  this  table  test  differences  in  slope  across 
forecast  sources,  against  a  null  hypothesis  of  no  difference.   These 


t-statistics  have  degrees  of  freedom  greater  than  2.000.   For 
sided  test,  the  .05  and  .01  critical  points  from  the  X  (  0  ,  1  ) 
distribution  are  1.65  and  2.33,  respectively. 


a  one 


t-statistics  have  degrees  of  freedom  greater  than  2,000.   For 
sided  test,  the  .05  and  .01  critical  points  from  the  N(0,1) 
distribution  are  1.65  and  2.33,  respectively. 


one 


Table  A8 


Regression  Summary  Statistics  for  the  Regression 
of  EPS  Forecast  Error  on  Excess  Return,  for  Five  Forecast  Horizons 


Dependent 
Var  iabl e : 

Std ' zed 

error 

[ eqn .  Al ] 


Percent 
error 
[eqn .  A2 ] 


eijtt 


li  JT 


a2itT  +  Bit  UjtX 


240 


R2 

F(k-1 ,N~k) 

k 

N 

.  139 

5  .  80 

195 

5783 

R2 

F(k-1  ,N-k) 

k 

N 

.  104 

4  .  49 

199 

5928 

180 

.  138 

5  .  93 

195 

5966 


.0  99 

4  .  37 

199 

6111 


Vijtl 


Horizon 
120 

.  129 

5.61 

195 

6057 


.  097 

4  .  35 

199 

6202 


60 

.  144 

6  .  25 

195 

6083 


.  088 

4  .  02 

199 

6228 


12) 


.  154 

4.51 

191 

3653 


.  138 

4  .  08 

195 

3740 


Notes 


Selected  critical  points 

for  the  F  distribution  are: 

ot_  =  _^001 
F( 120 , 120 )         1.77 
F( 120 ,  «  )         1.45 


Table  A9 

Summary  of  Regression  Results: 
Incremental  F-statistics  for  Groups  of  Parameters, 
in  the  Regression  of  EPS  Forecast  Error  on  Excess  Return 


Dependent 
Variable  : 

S  t  d  '  z  e  d 

error 

[ eqn .  Al ] 


240 

180 

Horizon 
120 

60 

year 

F(kl  ,  N 
kl 

-k) 

48  .  82 
6 

47  .  56 
6 

25  .  77 
6 

19  .  04 
6 

3.47 
6 

firm 

F  (k2  .  N- 
k2 

-k) 

4  .  36 
179 

4  .  54 
179 

4  .  86 
179 

6.10 
179 

4.61 
179 

excess 
return 

F(k3  ,  N- 
k3 

-k) 

4.71 
5 

4  .  74 
5 

8  .  89 
5 

1  .  64 
5 

1  .  19 
3 

N-k 


5588 


5771 


5862 


5888 


34  6  2 


Percent 
error 
[eqn .  A2] 


year 

F(kl  ,N-k  ) 
kl 

18.94 
6 

15.9 
6 

18.21 
6 

15.47 
6 

2.71 
6 

firm 

F(k2 , N-k ) 
k2 

4  .  08 
183 

3  .  98 
183 

3  .  83 
183 

3  .  74 
183 

4.19 
183 

excess 
return 

F(k3 , N-k) 
k3 

2  .  51 
5 

3  .  07 
5 

7  .  28 

5 

1  .  72 
5 

0.18 

o 

o 

N-k 


5729 


5912 


6003 


6029 


3545 


Notes  : 

The  full  model  F-statistic  tests  the  null  hypothesis  that  the 
regression  model  (12)  has  explanatory  power.   The  year,  firm,  and 
excess  return  F-statistics  test  the  incremental  explanatory  power  of 
including  groups  of  parameters  in  the  model.   Selected  critical  points 
for  the  F  distribution  are: 


Numerator  d.f. 

3 

5 

6 

120 


a  = 

05 

a  = 

.  001 

Denominat 

or 

d.f. 

Denominator  d. 

120 

00 

120 

00 

2  .  68 

2 

60 

5  .  78 

5  .  42 

2  .  29 

2 

21 

4  .  42 

4.10 

2.18 

2 

10 

4  .  04 

3  .  74 

1  .  35 

1 

22 

1  .  77 

1.45 
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